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COVALENT JOINING OF DNA STRANDS TO RNA STRANDS 
CATALYZED BY VACCINIA TOPOISOMERASE 

5 This application claims the benefit of. copending U.S. 

Provisional Application Serial No. 60/049,405, filed June 
12, 1997. 

This invention was made with support under Grant 
10 No. GM46330 from the National Institutes of Health, . U.S. 

Department of Health and Human Services. Accordingly, the 
United States Government has certain rights in the 
invention. 

15 Throughout this application, various references are 

referred to within parentheses. Disclosures of these 
publications in their entireties are hereby incorporated by 
reference into this application to more fully describe the 
state of the art to which this invention pertains. Full 

20 bibliographic citations for these references may be found 

at the end of this application, preceding the sequence 
listing and claims. 

Background of the Invention 
25 Vaccinia topoisomerase binds duplex DNA and forms a 

covalent DNA- (3 * -phosphotyrosyl ) -protein adduct at the 
sequence B'-CCCTT*. The enzyme reacts readily with a 36-mer 
CCCTT strand ( DNA - p - RNA ) composed of DNA 5' and RNA 3' of 
the scissile bond. However, a 36-mer composed of RNA 5' and 
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DNA 3* of the scissile phosphate (RNA-p-DNA) is a poor 
substrate for covalent adduct formation. Vaccinia 
topoisomerase efficiently transfers covalently held CCCTT- 
containing DNA to 5 ' -OH terminated RNA acceptors; the 
5 topoisomerase can therefore be used to tag the 5' end of 

RNA in vitro. 

Religation of the covalently bound CCCTT- containing DNA 
strand to a 5 1 -OH terminated DNA acceptor is efficient and 
rapid (/c,- el > 0.5 sec""), provided that the acceptor DNA is 
capable of base-pairing to the noncleaved DNA strand of the 
topoisomerase -DNA donor complex. The rate of strand 
transfer to DNA is not detectably affected by base 
mismatches at the 5' nucleotide of the acceptor strand. 
Nucleotide deletions and insertions at the 5' end of the 
acceptor slow the rate of religation; the observed 
hierarchy of reaction rates is: +1 insertion > -1 deletion 
> +2 insertion >> -2 deletion. These findings underscore 
the importance of a properly positioned 5' OH terminus in 
transesterif icat ion reaction chemistry, but also raise the 
possibility that topoisomerase may generate mutations by 
sealing DNA molecules with mispaired or unpaired ends. 

Vaccinia topoisomerase, a 3 14 -amino acid eukaryotic type I 
25 enzyme, binds and cleaves duplex DNA at a specific target 

sequence 5 1 - (T/C) CCTT 1 (1-3). Cleavage is a 
transesterif ication reaction in which the Tp'N 
phosphodiester is attacked by Tyr-274 of the enzyme, 
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resulting in the formation of a DNA- (3 ' -phosphotyrosy 1 ) 
protein adduct (4). The covalently bound topoisomerase 
catalyzes a variety of DNA strand transfer reactions. It 
can religate the CCCTT-containing strand across the same 
bond originally cleaved (as occurs during the relaxation of 
supercoiled DNA) or it can ligate the strand to a 
heterologous acceptor DNA 5' end, thereby creating a 
recombinant molecule (5-7) . 

Duplex DNA substrates containing a single CCCTT target site 
have been used to dissect the cleavage and strand transfer 
steps. A cleavage-religation equilibrium is established 
when topoisomerase transesterif ies to DNA ligands 
containing £l8-bp of duplex DNA 3 ' of the cleavage site (8- 
11) . The reaction is in equilibrium because the 5 ' -OH 
terminated distal segment of the scissile strand remains 
poised near the active site by virtue of the fact that it 
is stably base-paired with the nonscissile strand . About 
20% of the CCCTT-containing strand is covalently bound at 
equilibrium (11) . "Suicide" cleavage occurs when the CCCTT- 
containing substrate contains no more than fifteen base 
pairs 3' of the scissile bond, because the short leaving 
strand dissociates from the protein-DNA complex. In enzyme 
excess, >90% of the suicide substrate is cleaved (11). 

The suicide intermediate can transfer the incised CCCTT 
strand to a DNA acceptor. Intramolecular strand transfer 
occurs when the 5 ' -OH end of the noncleaved strand of the 



suicide intermediate attacks the 3* phosphotyrosyl bond and 
expels Tyr-274 as the leaving group. This results in 
formation of a hairpin DNA loop (5) . Intermolecular 
religation occurs when the ■ suicide intermediate is provided 
with an exogenous 5 1 -OH terminated acceptor strand, the 
sequence of which is complementary to the single strand 
tail of the noncleaved strand in the immediate vicinity of 
the scissile phosphate (5) . In the absence of an acceptor 
strand, the topoisomerase can transfer the CCCTT strand to 
water, . releasing a 3 ' -phosphate- terminated hydrolysis 
produce, or to glycerol, releasing a 3 ' -phosphoglycerol 
derivative (.12) . Although the hydrolysis and glycerololysis 
reactions are much slower than religation to a DNA acceptor 
strand, the extent of strand transfer to non-DNA 
nucleophiles can be* as high as 15-40%. 

The specificity of vaccinia topoisomerase in DNA cleavage 
and its versatility in strand transfer have inspired 
topoisomerase-based strategies for polynucleotide synthesis 
in which DNA oligonucleotides containing CCCTT cleavage 
sites serve as activated linkers for the joining of other 
DNA molecules with compatible termini (13). The present 
study examines the ability of the vaccinia topoisomerase to 
cleave and rejoin RNA-containing polynucleotides. It was 
shown previously that the enzyme did not bind covalently to 
CCCTT-containing molecules in which either the scissile 
strand or the complementary strand was composed entirely of 
RNA (S)-. To further explore the pentose sugar specificity 
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of the enzyme, we have prepared synthetic CCCTT-containing 
substrates in which the scissile strand is composed of DNA- 
and RNA-cohtaining halves. In this way, we show that the 
enzyme is indifferent to RNA downstream of the scissile 
phosphate, but is does not form the covalent complex when 
the region 5* of the scissile phosphate is in RNA form. 
Also assessed is the contribution of base -pairing by the 5' 
end of the acceptor strand to the rate of the DNA strand 
transfer reaction. 

Summary of the Invention 

The present invention provides a method of covalently 
joining a DNA strand to an RNA strand comprising (a) 
forming a topoisomerase -DNA intermediate . by incubating a 
DNA cleavage substrate comprising a topoisomerase cleavage 
site with a topoisomerase specific for that site, wherein 
the topoisomerase -DNA intermediate has one or more 5' 
single-strand tails; and (b) adding to the topoisomerase- 
DNA intermediate an acceptor RNA strand complementary to 
the 5' single-strand tail under conditions permitting a 
ligation of the 5' single-strand tail of the topoisomerase- 
DNA intermediate to the RNA acceptor strand and 
dissociation of the topoisomerase, thereby covalently 
joining the DNA strand to the RNA strand. The DNA cleavage 
substrate may be created by hybridizing a DNA strand having 
a topoisomerase cleavage site to one or more complementary 
DNA strands, thereby forming a DNA cleavage substrate 
having a topoisomerase cleavage site and a oligonucleotide 
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leaving group located 3' of a scissile bond or may be a 
plasmid vector comprising a topoisomerase cleavage site. 

The present invention also provides a covalent. 
topoisomerase-DNA intermediate having a 5' single-strand 
tail. 

Another aspect of the present invention provides a DNA-RNA 
molecule covalently joined by topoisomerase catalysis. 

The present invention provides a covalently joined DNA-RNA 
molecule having a labeled 5' end. 

The present invention further provides a method of tagging 
a 5* end of an RNA molecule comprising: (a) forming a 
topoisomerase -DNA intermediate by incubating a DNA cleavage 
substrate comprising a topoisomerase cleavage site with a 
topoisomerase specific for that site, wherein the 

... topoisomerase-DNA intermediate has one or mor^: 5' single- 
strand tails; and (b) adding to the topoisomerase-DNA 
intermediate a 5' -hydroxy 1 terminated RNA molecule 
complementary to the 5' single-strand tail under conditions 

< permitting a ligation of the covalently bound DNA strand of 
the topoisomerase-DNA intermediate to the RNA molecule and 
dissociation of the topoisomerase, thereby forming a 5* end 
tagged DNA-RNA ligation product. The DNA cleavage 

substrate can be created, for example, by hybridizing a DNA 
strand having a topoisomerase cleavage site to a 
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complementary DNA strand, thereby forming a DNA cleavage 
substrate having a topoisomerase cleavage site and a 
oligonucleotide leaving group located 3* of a scissile 
bond . 

5 

Another aspect of the present invention provides a 5* end 
tagged RNA molecule. 

In another aspect the present invention also provides a 
10 DNA -RNA molecule which has been joined in vitro by the use 

of a topoisomerase. 

The present invention further provides a method of tagging 
a 5 1 end of a capped messenger RNA comprising: 

15 a) isolating mRNA from cells or a tissue; b) removing an 

RNA cap structure from the isolated mRNA, resulting in a 
de -capped RNA; c) dephosphorylat ing the de-capped RNA, 
thereby forming a de- capped and dephosphorylated RNA; 
d) constructing a DNA cleavage substrate for topoisomerase 

20 having a topoisomerase cleavage site and a complementary 

strand, the complementary strand having a mixed or random 
base composition downstream of the topoisomerase cleavage 
site, the DNA cleavage substrate being designated as a DNA- 
(N), substrate; e) cleaving the DNA- (N) substrate with a 

25 topoisomerase, .thereby forming a covalent topoisomerase - 

DNA - ( N ) M complex containing a 5' tail of mixed or random 
base composition on a nohcleaved strand; and f) incubating 
the cleaved covalent topoisomerase-DNA- (N) M complex with 
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the de- capped and dephosphorylated RNA formed in step (c) 
to form a 5' DNA-tagged DNA-RNA ligation product. 

As used herein the number of bases (N) of the DNA cleavage 
substrate, designated supra as a DNA- (N) substrate , may be 
from one to four bases long. 

The present invention also provides a method of isolating 
and cloning a capped mRNA after subtraction of non-capped 
RNA comprising: a) isolating mRNA from cells or a tissue; 
b) dephosphorylating the mRNA; c) incubating a cleaved 
topoisomerase-BioDNA- (N) complex with the dephosphorylated 
mRNA to form a 5 f BioDNA- tagged DNA-RNA ligation product; 
d) removing the 5' BioDNA- tagged DNA-RNA ligation product 
and any unreacted cleaved topoisomerase-BioDNA- (N) complex 
by adsorption to streptavidin and recovering any 
nonadsorbed material, said material being enriched for RNA 
having a capped 5' end and being resistant to 
. dephosphorylation in step (b) , thereby being unable to 
react with the cleaved topoisomerase-BioDNA- (N) complex; e) 
removing of the 5 1 end cap from the enriched RNA recovered 
from the nonadsorbed material in step (d) ; f ) 
dephosphorylating the de-capped RNA, thereby forming a de- 
'capped and dephosphorylated RNA; g) incubating a cleaved 
topoisomerase-BioDNA- (N) complex with the de-capped and 
dephosphorylated RNA to form a 5' BioDNA- tagged DNA-RNA 
ligation product; h) affinity purifying the 5* DNA- tagged 
DNA-RNA ligation product; and i) PGR amplification of the 
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decapped and dephosphorylated RNA of the DNA-RNA ligation 
product using a sense primer corresponding to a scissile 
strand of the topoisomerase cleavage substrate 5' of the 
site of cleavage and an antisense primer, said antisense 
5 primer being complementary to either a 3' poly (A) tail or 

to an internal RNA sequence . 

The present invention also provides a method of obtaining 
full-length gene sequences comprising attaching a DNA tag 
10 to an isolated mRNA sequence, and using the DNA-tagged mRNA 

as a template for DNA synthesis. DNA may be further 
inserted into an expression vector and used to express 
recombinant protein. 

Brief Description q£ the Figures 

Figure 1A-B. Topoisomerase cleavage of DNA-p-RNA and 

RNA-p-DNA strands, (A) The 36 -bp substrate used in the 
cleavage reactions is shown, with the -"P-labeled scissile 
phosphate indicated by the filled circle. The segments of 
the top strand flanking the scissile phosphate, which are 
either DNA or RNA, are bracketed; the bottom strand is all- 
DNA. (B). Reaction mixtures (20 fxl) containing 50 mM Tris- 
HC1 (pH 8.0), 0.2 pmol of substrate (either DNA- p- RNA or 
RNA-p-DNA) and topoisomerase as indicated were incubated at 
37°C for 10 min. Covalent adduct formation (% of input 
label transferred to the topoisomerase) is plotted as a 
function of the amount of enzyme added. 
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Figure 2A-B. Kinetics of cleavage of RNA-containing 
36-mer substrates. Reaction mixtures contained (per 20 M l) 
50 mM Tris-HCl (pH 8.0), 0 . 2 pmol of radiolabeled 36-mer 
substrate and 1 pmo l of topoisomerase . Covalent adduct 
formation (% Q f input label transferred to the 
topoisomerase) is plotted as a function of the time of 
incubation at 37°C. (A) Cleavage of DNA-p-DNA and DNA-p-RNA; 
x-axis in sec. (B) Cleavage of RNA-p-DNA; x-axis.in min . 

Figure 3A-B. Strand transfer tQ an ^ acceptor- 

(A) The structures of the covalent topoisomerase -DNA 
complex (suicide intermediate) and the 18-mer acceptor 
strands (DNA or RNA) are shown. (B ) Religation reactions 
were performed under single- turnover conditions as 
described under Materials and Methods. The extent of 
religation (expressed as the percent of input labeled DNA 
converted to the 30-mer s.trand transfer product) is plotted 
as a function of incubation time. 

F±9Ure 4 - Analysis of the strand transfer 

reaction products. Reaction mixtures (20 M l) containing 50 
mM Tris-HCl (p H 8.0), 0 . 5 p mo l of 5 ' -libeled suicide DNA 
cleavage substrate, and 2.5 pmol of topoisomerase were 
incubated at 37°C for 10 min. Strand transfer was then 
initiated by adding a 50-fold excess of the acceptor DNA 
(18-mer D; lanes 1 and 2) or acceptor RNA (18-mer R ; lanes 
5 and 6), while simultaneously adjusting the mixtures to 
0.3 M NaCl. The religation reactions were quenched after a 
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10 min incubation by adding SDS to 0.2%. The samples were 
extracted with phenol/chloroform and ethanol -precipitated . 
The pellets were resuspended in either 12 /xl of 0 . 1M NaOH, 
1 mM EDTA (NaOH + ) or 12 fil of 10 mM Tris-HCl (pH 8.0), 1 
mM EDTA (NaOH -) . These samples were incubated at 37°C for 
16 h. Control samples containing the input 18-mer DNA 
substrate that had not been exposed to topoisomerase were 
treated in parallel (lanes 3 and 4). The alkali - treated 
samples were neutralized by adding 1.2 /zl of 1 M HCl . All 
samples were then ethanol -precipitated , resuspended in 
formamide, heated for 5 min at 95°C, and then 
electrophoresed through a 17% polyacrylamide gel containing 
7 M urea in TBE. An autoradiography of the. gel is shown. The 
positions of the 30-mer religation product and the 18-mer 
input strand are indicated at the left. Alkaline hydrolysis 
. of the RNA strand transfer reaction product (lane 6) 
yielded a discrete species denoted by the asterisk. 

Figure 5A-B. 5' DNA-tagging of RNA transcribed by T3 

RNA polymerase. (A) The structures of the covalent 
topoisomerase-DNA donor complex and the- RNA acceptor are 
shown. The 5 f single- strand tail of the suicide 
intermediate is complementary to the 18 nucleotides at the 
5' end of the T3 transcript. Reaction mixtures contained 
(per 15 pi) 5° Tris-HCl (pH 8.0), 0.3 M NaCl , and 0.1 

pmol of. 3 'P-GMP- labeled T3 transcript. (B) Religation was 
initiated by the addition of pre- formed topoisomerase-DNA 
donor (at a 10-fold molar excess over RNA acceptor). 
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Incubation was at 37°C. Aliquots (15 ^1) were removed at the 
times indicated and quenched immediately by adding SDS and 
EDTA . The samples were adjusted to 50% formamide, heated 
for 5 min at 95°C, and elect rophoresed through a 12% 
polyacrylamide gel containing 7 M urea in TBE . Transfer of 
the 12-nucleotide DNA donor strand to the 5 ' end of the 
labeled 36-mer T3 transcript yielded a labeled 48-mer 
product. Conversion of input 36-mer to 48-mer was 
quantitated by scanning the gel with a phosphor imager . 

Figure 6A-C. Kinetics of topoisomerase-catalyzed 

strand transfer reactions resulting in DNA deletions and 
insertions. ( A ) The structure of the pre- formed donor 
complex is shown at the top of the Figure. Religation 
reactions were performed under single- turnover conditions 
as described under Materials and Methods. All DNA acceptors 
were included at a 50-fold molar excess over the input 
CCCTT-containing substrate. (B) Deletion formation. The 
structures of the completely base-paired 18-mer acceptor 
DNA oligonucleotide (open circle) , a 17-mer oligonucleotide 
that anneals to the donor complex to leave a 1-nucleotide 
gap (filled square) and a 16-mer strand that anneals to 
leave a 2 -nucleotide gap (square) are shown. (C) Insertion 
formation. The structures of the completely base-paired 18- 
mer acceptor (open circle,, a 19 - mer oligonucleotide 
containing l extra 5' nucleotide (filled triangle) and a 
20-mer acceptor containing 2 extra 5' nucleotides 
(triangle), are shown. The extent of religation is plotted 
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• as a function of incubation time. 

Figure 7 ' Analysis of deleted and inserted DNA 

strand transfer products. Religation to acceptors with 
recessed and protruding 5- ends was performed as described 
in the legend to Fig. 6 . The reaction products were, 
analyzed by electrophoresis through a 17% polyacrylamide 
gel containing 7 M urea in TBE . An autoradiograph of the 
gel is shown. The acceptor strands were as follows: no 
acceptor (lane 2); perfectly paired 18-mer (lanes 3 and 8); 
17-mer with a 1-nucleotide gap (lane 4); 16-mer with a 2- 
nucleotide gap (lane 5); 19-tner with a 1-nucleotide insert 
(lane 6); 20-mer with a 2 -nucleotide insert (lane 7). 
Control samples containing the 5.' -labeled 18-mer scissile 
strand but no topoisbmerase were analyzed in lanes 1 and 9 . 

Figure 8, Strand transfer to DNA acceptors 

containing a single 5 • base mismatch. Religation reactions 
were performed under single- turnover conditions as 
described under Materials and Methods. All DNA acceptors 
were included at a 50 -fold molar excess over the input 
CCCTT-containing substrate. The structures of the fully 
complementary 18-mer and the three terminal -nucleotide 
variants are shown. 

Figure 9A-B. Kinetics of intramolecular hairpin 

formation. (A) Hairpin formation without potential for 
base -pairing. DNA cleavage substrates were prepared by 
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annealing the 5- ^-labeled 18-mer scissile stran d to a 30- 
mer co mp l em entary strand (filled circle) or an 18-mer 
complementary strand (circle) ; the structures of the 
substrates are shown with the topoisomerase cleavage sites 
indicated by arrows. Reaction mixtures containing (per 20 
Ml) 50 mM Tris HC1 ( pH 7.5), 0.5 pmol of DNA substrate, and 
1 pmol of topoisomerase were incubated at 37°C for 10 min. 
The mixtures were then adjusted to 0.3 M Nad. Aliquots (20 
-/il) were withdrawn immediately prior to adding salt (time 
zero) and at various intervals after adding salt; the 
reactions were quenched immediately by adding an equal 
volume of stop solution (1% SDS, 95% formamide, 20 mM 
EDTA), The samples were heat -denatured and electrophoresed 
through a 17% polyacrylamide gel containing 7 M urea in 
TBE. The extent of intramolecular strand transfer 
(expressed as percent of the input labeled substrate 
converted to hairpin product) is plotted as a function of 
time after addition of NaCl. (B) Hairpin formation with 
potential for base-pairing. The structure of the 18-mer/30- 
mer cleavage substrate is shown, with the topoisomerase 
cleavage site indicated by an arrow. A reaction mixture 
containing (per 20 M l) 50 mM Tris HC1 ( P H 7.5), 0.5 pmol of 
DNA substrate, and 1 pmol of topoisomerase was incubated at 
37°C for 2 min. The mixtures were then adjusted to 0.3 M 
NaCl. Aliquots (20 M l) were withdrawn immediately prior to 
adding- salt (time zero) and at various intervals after 
adding salt. The extent of intramolecular strand transfer 
is plotted asa function of time after addition of NaCl . 
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Figure 10A-B. Affinity Tagging of RNA Using Vaccinia 

Topoisomerase . (A) The strand transfer reaction pathway is 
diagramed in the Figure. The biotinylated DNA substrate 
which contains a single topoisomerase recognition site is 
5 immobilized on the Dynabeads (Dynal) streptavidin solid 

support. The biotin moiety (indicated by the black square) 
is introduced at the 5 f end of the CCCTT-coritaining strand 
via standard . protocols for automated oligonucleotide 
synthesis. The purified vaccinia topoisomerase is reacted 
10 ■ with the bead-bound DNA to form a covalent enzyme -DNA donor 

complex, as illustrated. Enzyme not bound to DNA is 
removed by washing the beads with buffer. The strand 
transfer reaction is initiated by addition of the [ 32 P] - CMP 
labeled T7 transcript which is dephosphorylated by prior 
15 treatment with alkaline phosphatase. The 5' single-strand 

tail of the donor complex is complementary to the 12 
nucleotides at the 5' end of the T7 transcript. Religation 
of the covalently held biotinylated DNA strand to the T7 
transcript is observed as conversion of the 30-mer RNA to 
20 a product of- 50 nucleotides. The mixture was incubated at 

3 7°C for 15 min. The beads were then recovered, by 
centrif ugation , washed, and resuspended in 20 ptl of buffet- 
containing 0.8% SDS and 80% f ormamide . The samples were 
heated at 95°C for 5 min, centrifuged for 5 min, then the 
25 supernatants were electrophoresed through a 12% 

polyacrylamide gel containing 7M urea in TBE buffer. (B) 
An autoradiograph of the gel is shown in the Figure. Lane 
B (Bound) - product of the strand transfer reaction bound 



MSDOCID: <WO 98S6943A1_IB> 



WO 98/56943 PCT/US98/12372 

-16- 

to the Dynabeads; lane F (Free) - supernatant from the 
strand transfer reaction. The positions of the input 30- 
mer T7 transcript and the 50-mer product are shown . at the 
right . 

5 . 

Figure 11. A schematic representation of a method 

of using DNA-tagged mRNA to obtain full-length gene 
■sequences. Briefly, capped full-length mRNA is isolated by 
attachment to a solid support, such as by using 

10 biotinylated-capped mRNA bound to a . magnetic bead 

conjugated with s treptavidin . The isolated mRNA is 
decapped (using tobacco acid pyrophosphatase) and 
dephosphorylated (using alkaline phosphatase) then modified 
with a DNA tag using the methods outlined below. The DNA- 

15 tagged mRNA is used to generate first strand cDNA using 

reverse transcriptase and amplified using PCR.. The 
amplified cDNA is then inserted into a plasmid vector. 

Detailed Description of the Invention 

20 Throughout this application, the following standard 

x . 

abbreviations are used to indicate specific nucleotides: 
C=cytosine A=adenosine U=uracil 

T=thymidine G-guanosine 

25 The- present invention provides a method of covalently 

joining a DNA strand to an RNA strand comprising (a) 
forming a topoisomerase-DNA intermediate by incubating a 
DNA cleavage substrate comprising a topoisomerase cleavage 
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site with a topoisomerase specific for that site, wherein 
the topoisomerase -DNA intermediate has one or more 5' 
single-strand tails; and (b) adding to the topoisomerase- 
DNA intermediate an acceptor RNA strand complementary to 
5 the 5- single-strand tail under conditions permitting a 

ligation of the covalently bound DNA strand of the 
topoisomerase-DNA intermediate to the RNA acceptor strand 
and dissociation of the topoisomerase, thereby covalently 
joining the DNA strand to the RNA strand. The DNA cleavage 
10 • substrate may be created by hybridizing a DNA strand having 
a topoisomerase cleavage site to one or more complementary 
DNA strands, thereby forming a DNA cleavage substrate 
having a topoisomerase cleavage site and a oligonucleotide 
leaving group located 3' of a scissile bond or may be a 
15 plasmid vector comprising a topoisomerase cleavage site. 

In an embodiment of the above -described method, the 
topoisomerase cleavage site is a sequence comprising CCCTT . 
in a preferred embodiment the topoisomerase is a vaccinia 
topoisomerase enzyme. In a further embodiment the vaccinia 
20 topoisomerase enzyme is a modified vaccinia topoisomerase 

enzyme. In another embodiment the DNA strand having a 
topoisomerase cleavage site is radiolabeled . In a 
preferred embodiment the radiolabel is 32 P or a 
radiohalogen. Means for radio labeling nucleotides are 
25 well known in the art (see Ausubel, et. al . , Short 

Protocols in Molecular Biology, 3rd ed . , Wiley, 1995; US 
patent 5.746,997 issued 05/05/98). In another preferred 



MSDOC1D: <WO 9856943At_IB> 



WO 98/56943 PCT/US98/12372 

-18- 

embodiment the DNA strand having a topoisomerase cleavage 
site is labeled with a biotin moiety or another, affinity 
purification tag such as chitin binding domain, 
glutathione-S- transferase, and the like. Methods of adding 
affinity labels to nucleotides are well known in the art 
(see Carniaci, et . al . , Genomics 37: 327-336,1996 ; Ausubel, 
et. al . , supra). In an embodiment the topoisomerase -bound 
DNA intermediate and the acceptor RNA strand are ligated in 
vitro . 

The present invention provides a covalent topoisomerase -DNA 
intermediate molecule having a 5' single - strand tail. In 
an embodiment of the covalent . topoisomerase-DNA 
intermediate molecule, the 5' single - strand tail comprises 
a specific sequence. In another embodiment the covalent 
topoisomerase -DNA intermediate molecule having a 5' single- 
strand tail is generated by the above-described method of 
covalently joining a DNA strand to an RNA strand. In a 
further embodiment of the covalent topoisomerase -DNA 
^intermediate molecule having 5* . single - strand tail 
generated by the above -described method of the 5' single- 
strand tail comprises a specific sequence. In another 
embodiment of the covalent topoisomerase -DNA intermediate 
molecule having a 5' single-strand tail . generated by step 
(a) of the above -described method the DNA strand is 
radiolabelled .. In a preferred embodiment of the covalent 
topoisomerase -DNA intermediate molecule the radiolabel is 
3<i P or a radiohalogen . In another embodiment of the 



PCT/US98/12372 

WO 98/56943 

19- 

covalent topoisomerase-DNA intermediate molecule having a 
5- single-strand tail generated by step (a) of the above- 
described method, the DNA strand is affinity labeled. In 
a preferred embodiment of the covalent topoisomerase-DNA 
5 intermediate molecule, wherein the affinity label is a 

biotin moiety, a chitin binding domain, a glutathione - S - 
transferase moiety, and the like. 

The present invention further provides a DNA-RNA molecule 
covalently joined by topoisomerase catalysis. 

10 The present invention provides a DNA-RNA molecule 

covalently joined by the above-described method of 
covalently joining a DNA strand to an RNA strand. In a 
preferred embodiment the covalently joined DNA-RNA molecule 
has a 5' end label. In a further embodiment the 5' end 

15 label is 32 P or a radiohalogen . In another embodiment the 

5- end label is a biotin moiety, a chitin binding domain, 
a glutathione-S-transf erase moiety, and the like. 

The present invention provides a covalently joined DNA-RNA 
molecule having a labeled 5' end. In a preferred 
20 embodiment of the covalently joined DNA-RNA molecule the 

5- end label is 32 P or a radiohalogen. In another preferred 
embodiment of the covalently joined DNA-RNA molecule the 
5- end label is a biotin moiety, a Chitin binding domain, 
a glutathione-S-transf erase moiety, and the like. 

25 The present invention further provides a method of tagging 

a 5' end of an RNA molecule comprising: (a) forming a 
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topoisomerase-DNA intermediate by incubating a DNA cleavage 
substrate comprising a topoisomerase cleavage site with a 
topoisomerase specific for that site, wherein the 
topoisomerase-DNA intermediate has one or more 5* single- 
strand tails; and (b) adding to the topoisomerase-DNA 
intermediate a 5' -hydroxy 1 terminated RNA molecule 
complementary to the 5' single-strand tail under conditions 
permitting a ligation of the 5' single - strand tail of the 
topoisomerase-DNA intermediate to the RNA molecule and 
dissociation of the topoisomerase, thereby forming a 5' end 
tagged DNA -RNA ligation product. The DNA cleavage 

substrate can be created, for example, by hybridizing a DNA 
strand having a topoisomerase cleavage site to a 
complementary DNA strand, thereby forming a DNA cleavage 
substrate having a topoisomerase cleavage site and a 
oligonucleotide leaving group located 3' of a scissile 
bond . 

The RNA molecule can be the product of in vitro synthesis 
■or can have been isolated from cells or tissues. Methods 
of synthesizing RNA In vitro are well known in the art 
(see, for example, Ausubel, et al, supra). Methods of 
isolating RNA from cells and/or tissues are also well known 
in the art (see, Ausubel, et al, supra). Cells and tissues 
suitable for use in obtaining RNA useful in the practice of 
the present invention include both animal cells and plant 
cells. Particularly preferred cells include mammalian 
cells (such as rodent cells, primate cells, and the like) 
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and insect cells. RNA may also be isolated from 
prokaryotic cells such as bacteria. 

In a preferred embodiment of the above -described method, 
the RNA molecule is a dephosphorylated after synthesis or 
5 isolation. In another preferred embodiment the 

dephosphorylation is achieved by treatment of the RNA 
molecule with alkaline phosphatase. In a preferred 
embodiment the topoisomerase is a vaccinia topoisomerase 
enzyme. In another embodiment the vaccinia topoisomerase 
10 enzyme is a modified vaccinia topoisomerase enzyme. In a 

preferred embodiment the cleavage site- comprises CCCTT . In 
another preferred embodiment the method further comprises 
introducing a biotin moiety or another affinity 
purification moiety, to the DNA cleavage substrate prior to 
15 step (a) . In still another preferred embodiment the method 

further comprises immobilizing the affinity purification 
tagged DNA cleavage substrate on a solid support prior to 
step (a). In a preferred embodiment the solid support ■ is 
a sepharose resin or magnetic beads having an affinity 
20 purification material, such as avidin, streptavidin , 

chitin, glutathione and the like, bound thereto. Methods 
of preparing such materials are well known in the art. In 
yet another preferred embodiment the method further 
comprises purifying a biotinylated 5- end tagged DNA - RNA 
25 ligation product by separating the solid support to which 

the biotinylated 5' end tagged DNA -RNA ligation product is 
immobilized from a liquid phase comprising unmodified RNA. 
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In a preferred embodiment the 5' end of the DNA cleavage 
substrate is affinity labeled. In a preferred embodiment 
the affinity label is a biotin moiety. In another 
preferred embodiment the method further comprises 
immobilizing the biotinylated 5' end affinity labeled DNA 
cleavage substrate on a solid support. In a preferred 
embodiment the solid support is modified with streptavidin . 
In another preferred embodiment the method further 
comprises purifying the biotinylated 5' end affinity 
labeled DNA-RNA ligation product by separating the 
streptavidin-modif ied solid support to which the 5' end 
tagged DNA-RNA ligation product, is immobilized from a 
liquid phase comprising unmodified RNA. 

As used herein, unmodified RNA is defined as an RNA strand 
or strands which have not been joined covalently to a DNA 
strand. 

The present invention provides a 5' end tagged RNA 
molecule. In a preferred embodiment of the 5' end tagged 
RNA molecule, the tag is a DNA sequence. In a further 
preferred embodiment the 5 ' end tagged RNA molecule further 
comprising a 5' end label. In an embodiment the 5' end 
label is 3 ~P or a radiohalogen . In another embodiment the 
5' end label is a biotin moiety or another affinity 
purification moiety. 

In an embodiment the 5' end tagging RNA molecule is 
generated by the above -described method of tagging a 5' end 
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of an RNA molecule. In an embodiment the 5' end tagged RNA 
molecule further comprises a 5' end label. In a further 
embodiment the 5' end label is 32 P. In another embodiment 
the 5* end label is a biotin moiety. 

In another aspect the present invention further provides a 
DNA-RNA molecule which has been joined in vjtXQ by the use 
of a topoisomerase . 

As used herein the number of nucleotides (N) of the DNA 
cleavage substrate, designated supra as a DNA- (N) 
substrate, may be from one to four nucleot ide ( s ) long. 

The present invention also provides a method of tagging a 
5' end of a capped messenger RNA comprising: a) isolating 
mRNA from cells or a tissue; b) removing an RNA cap 
structure, from the isolated mRNA, resulting in a de-capped 
RNA; c) dephosphorylating the de-capped RNA, thereby 
forming a de-capped and dephosphorylated RNA; d) 
constructing a DNA cleavage substrate for topoisomerase 
having a topoisomerase cleavage site and a complementary 
strand, the complementary strand having a mixed or random 
base composition downstream of the topoisomerase cleavage 
site, the DNA cleavage substrate being designated as a 
DNA- (N) substrate; e) cleaving the DNA- (N) substrate with 
a topoisomerase, thereby forming a covalent topoisomerase - 
. DNA- (N) complex containing a 5' tail of mixed or random 
base composition on a noncleaved strand; and f) incubating 
the cleaved covalent topoisomerase -DNA- (N) complex with 
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the de-capped and dephosphorylated RNA formed in step (c) 
to form a 5' DNA-tagged DNA-RNA ligation product. 

in an embodiment of the above-described method, the removal 
of the RNA cap structure is by either of enzymatic 
treatment of the mRNA with a -pyrophosphatase or chemical 
decapping by periodate oxidation and beta elimination. In 
a preferred embodiment the pyrophosphatase ' is tobacco acid 
pyrophosphatase. m another preferred embodiment the 
topoisomerase cleavage site is CCCTT. m yet . another 
preferred embodiment the DNA- (N) cleavage substrate has a 
biotin moiety upstream of the cleavage site and is 
designated BioDNA- (N) . m an embodiment the method further 
comprises affinity purification of the biotinylated 5- DNA- 
tagged DNA-RNA ligation product by a binding of the biotin 
iety to streptavidin prior to step (e) . 



mo 



The present invention also provides a 5' tagged DNA-RNA 
ligation product generated by the method of tagging a 5- 
end of a capped messenger RNA . In an embodiment the 5- 
tagged DNA-RNA ligation product further comprises a 5- end 
label. in a further embodiment of the 5- end tagged DNA- 
RNA ligation product, the label is "p. In another 
embodiment of the 5' end tagged DNA-RNA ligation product, 
the label is a biotin moiety. 

The present invention also provides a method of isolating 
and cloning a capped mRNA after subtraction of non-capped 
RNA comprising: a) isolating mRNA from cells or a tissue; 
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b) dephosphorylating the mRNA; c) incubating a cleaved 
topoisomerase-BioDNA- (N) complex with the dephosphorylated 
mRNA to form a 5' BioDNA- tagged DNA-RNA ligation product; 
d) removing the 5' BioDNA- tagged DNA-RNA ligation product 
and any unreacted cleaved topoisomerase -BioDNA- (N) complex 
by adsorption to streptavidin and recovering any 
nonadsorbed material, said material being enriched for RNA 
having a capped 5' end and being resistant to 
dephosphorylation in step (b) , thereby being unable to 
react with the cleaved topoisomerase-BioDNA- (N) complex; e) 
removing of the 5' end cap from the enriched RNA recovered 
from the nonadsorbed material in step (d) ; f ) 
dephosphorylating the de- capped RNA, thereby forming a de- 
capped and dephosphorylated RNA; g) incubating a cleaved 
topoisomerase-BioDNA- (N) complex with the de-capped and 
dephosphorylated RNA to form a 5 1 BioDNA- tagged DNA-RNA 
ligation product; h) affinity purifying the 5 • DNA-tagged 
DNA-RNA ligation product; and PCR amplification of the 
decapped and dephosphorylated RNA of the DNA-RNA ligation 
product using a sense primer corresponding to a scissile 
strand of the topoisomerase' cleavage substrate 5' of the 
site of cleavage and an antisense primer, said antisense 
primer being complementary to either a 3' poly (A) tail or 
to an internal RNA sequence. In a preferred embodiment of 
the above-described method, the affinity purification in 
step (h) is by a binding of the 5' BioDNA- tagged DNA-RNA 
ligation product to streptavidin. In another preferred 
embodiment the removal of the RNA cap structure is by 
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either of enzymatic treatment of the mRNA with a 

pyrophosphatase or chemical decapping by periodate 
oxidation and beta elimination. In yet another preferred 

embodiment the pyrophosphatase is tobacco acid 
5 pyrophosphatase. 

In an embodiment of the method of covalently joining a DNA 
strand to an RNA strand, the 5' single strand tail has a 
specifically designed sequence. 

Another aspect of the present invention provides a method 
10 of targeting ligation of an RNA strand of interest within 

a mixture of RNA strands which comprises the above- 
described method of covalently joining a DNA strand to an 
RNA strand. In an embodiment of the method of targeting 
ligation of an RNA strand of interest within a mixture of 
15 RNA strands which comprises the method of covalently 

joining a DNA strand to an RNA strand, the 5' single strand 
tail provides specificity of a covalently joined DNA - RNA 
ligation product. 

In another preferred embodiment there is provided a method 
20 of obtaining a full-length gene sequence comprising: (a) 

isolating full-length mRNA ; (b) attaching a DNA tag 
sequence to the isolated mRNA; and (c) synthesizing cDNA 
using the tagged mRNA as a template. 

To insure that only full-length mRNA is used in this aspect 
25 of the invention (thus insuring the generation of a full- 

length gene sequence.) it is generally preferred 1 that only 



BNSDOCID: <WO 9856943A1_IB> 



WO 98/56943 PCT/US98/12372 

-27- 

capped mRNA be isolated. Eukaryotic primary transcripts 
are modified at the initiating, or 5', nucleotide of the 
primary transcript by the addition of a 5 • methylated cap 
(Shatkin, Cell 9:645, 1976) which may serve to protect the 
mRNA from enzymatic degradation. Only full-lengch 

transcripts will be so modified. The cap structure may be 
modified, such as by adding an affinity purification tag 
such as biotin, chitin binding domain, and the like 
(Carnici, et al , supra).. The affinity tagged capped mRNA 
can then be isolated from degraded mRNA or RNAs with poly 
A tails that are not full-length coding mRNAs . 

The affinity tagged mRNA can be separated from untagged RNA 
using affinity purification, for example by contacting the 
tagged mRNA with an affinity purification material such as 
a solid support complexed with streptavidin, avidin, 
chitin, glutathione, and the like. Alternatively, 
unmodified capped mRNA can be separated from RNA species 
lacking a cap by contacting the capped mRNA with a solid 
support complexed to, for example, phenylboronic acid (see 
Theus and Liarakos, Biotechniques 9 (5) : 610-612 , 1990). 
Suitable solid supports include various column 
chromatography gels, such as sepharose, agarose, and the 
like, and magnetic beads. 

Any eukaryotic cell type can serve as a source for mRNA to 
be used in practicing the method of the invention including, 
both animal cells and plant cells. Suitable animal cells 
include mammalian cells (rodent, non-human primate, 
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primate, goat, sheep, cow, and the like) and insect cells 
(moth, Drosophila, and the like) . Methods of extracting 
mRNA from different cell types are well known in the art 
(see, for example, Ausubel, et al, supra). 

The isolated mRNA is preferably decapped and 
dephosphorylated after isolation. Methods of decapping 
RNAs are well known in the art and include both enzymatic 
methods (such as by using a pyrophosphatase such as tobac 
pyrophosphatase) and. chemical methods (such as periodat 
oxidation and beta elimination). Likewise methods for 
dephosphorylation of RNA are well known in the art, for 
example by using alkaline phosphatase. 

A DMA tag sequence can be attached to the isolated full- 
length mRNA using the methods described above. A preferred 
DNA tag sequence is shown in Figure 11 both as a double 
stranded DNA cleavage substrate and as a covalent 
topoisomerase-DNA intermediate. The complementary strand 
of the topoisomerase-DNA intermediate includes a 3 ■ 
overhang of from i to 4 nucleotides, which can be any 
mixture of adenine, guanine, cytosine or thymine, 
designated in the figure as N. These nucleotides will base 
pair with the first l to 4 bases of the 5- end of the 
isolated mRNA molecule, allowing the covalently attached 
topoisomerase to catalyze the transesterif ication reaction 
which joins the DNA tag to the end of the RNA sequence. 
The DNA tag sequence comprises a topoisomerase recognition 
site, preferably CCCTT, and in addition may comprise a 
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recognition site for a site-specific restriction 
endonuclease, such as EcoRl , useful for the subsequent 
insertion of a cDNA molecule into an expression vector. 

The DNA-RNA molecule is used as a template for synthesis 
5 and amplification of full-length cDNA sequences, preferably 

using the polymerase chain reaction (PCR) , a technique well 
known in the art (see Ausubel, et al, supra) . Suitable 
primers include all or a portion of the 5 1 tag sequence of 
the DNA-RNA molecule and a gene specific 3' primer or an 
10 oligo dT primer. 

The amplified gene products are next isolated from the 
other components of the amplification reaction mixture. 
This purification can be accomplished using a variety of 
methodologies such as column chromatography, gel 
15 electrophoresis, and the like. A preferred method of. 

purification utilizes low-melt agarose gel electrophoresis. 
The reaction mixture is separated and visualized by 
suitable means, such as ethidium bromide staining. DNA 
bands that represent correctly sized amplification products 
20 are cut away from the rest of the gel and placed into 

appropriate corresponding wells of a 96 -well microtiter 
plate. These plugs are subsequently melted and the DNA 
contained therein utilized as cloning inserts. 

The purified, amplified gene sequences are next inserted 
25 into an expression vector. A variety of expression vectors 

are suitable for use in the practice of the present 
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invention, both for prokaryotic expression and eukaryotic 
expression. In general, the expression vector will have 
one or more of the following features: a promoter-enhancer 
sequence, a selection marker sequence, an origin of 
replication, an affinity purification tag sequence, an 
inducible element sequence, an epitope- tag sequence, and 
the like. 

Promoter-enhancer sequences are DNA sequences to which RNA 
polymerase binds and initiates transcription. The promoter 
determines the polarity of the transcript by specifying 
which strand will be transcribed. Bacterial promoters 
consist of consensus sequences, -35 and -10 nucleotides 
relative to the transcriptional start, which are bound by 
a specific sigma factor and RNA polymerase. Eukaryotic 
promoters are more complex. Most promoters utilized in 
expression vectors are transcribed by RNA polymerase II. 
General transcription factors (GTFs) first bind specific 
sequences near the start and then recruit the binding of 
RNA polymerase II. In addition to these minimal promoter 
elements, small sequence elements are recognized 
specifically by modular DNA-binding/trans-act ivating 
proteins (eg. AP-1, SP-1) which regulate the activity of a 
given promoter. Viral promoters serve the same function as 
. bacterial or eukaryotic promoters and either provide a 
specific RNA polymerase in trans (bacteriophage T7) or 
recruit cellular factors and RNA polymerase (SV4 0, RSV, 
CMV) . Viral promoters are preferred as they are generally 



WO 98/56943 PCT/US98/1 2372 

-31- 

particularly strong promoters. 

Promoters may be, furthermore, either constitutive or, more 
preferably, regulatable (i.e., inducible or derepressible) . 
Inducible elements are DNA sequence elements which act in 

5 conjunction with promoters and bind either repressors (eg. 

lacO/LAC Iq repressor system in E. coli) or inducers (eg. 
gall/GAL4 inducer system in yeast) . In either case, 
transcription is virtually "shut off" until the promoter is 
derepressed or induced, at which point transcription is 

10 "turned-on" . 

Examples of constitutive promoters include the int promoter 
of bacteriophage A, the bia promoter of the p-lactamase gene 
sequence of pBR322, the CAT promoter of the chloramphenicol 
acetyl transferase gene sequence of pPR325, and the like. 
15 Examples of inducible prokaryotic promoters include the 

major right and left promoters of bacteriophage <P L and P ? ), 
the trp, reca, lacZ, Lad, AraC and gal promoters of E . 
coli, the a-amylase (Ulmanen Ett at., J. Bacterioi. 
162:176-182, 1985) and the sigma-2 8-speci f ic promoters of 
20 B. subtilis (Gilman et al., Gene sequence 32 : 11-20 ( 1984) ) , 

the promoters of the bacteriophages of Bacillus (Gryczan, 
In: The Molecular Biology of the Bacilli, Academic Press, 
Inc., NY (1982)), Streptomyces promoters (Ward et at., Mol . 
Gen. Genet. 203:468-478, 1986), and the like. Exemplary 
25 prokaryotic promoters are reviewed by Glick (J. Ind. 

Microtiot. 1:277-282, 1987); Cenatiempo (Biochimie 68:505- 
516, 1986); and Gottesman (Ann. Rev. Genet. 18:415-442, 
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1984) . 

Preferred eukaryotic promoters include, for example, the 
promoter of the mouse metallothionein I gene sequence 
(Hamer et al., J. Mol . Appl . Gen. 1:273-288, 1982); the 
5 TK promoter of Herpes virus (McKnight, Cell 31:355-365, 

1982); the SV40 early promoter (Benoist et al . , Nature 
(London) 290:304-310, 1981); the yeast gall gene sequence 
promoter (Johnston et al . , Proc . Natl. Acad. Sci . (USA) 
79:6971-6975, 1982); Silver et al . , Proc. Natl. Acad. 
10 Sci. (USA) 81:5951-5955, 1984), the CMV promoter, the EF-1 

promoter, Ecdysone-respons i ve promoter (s), and the like. 

Selection marker sequences are valuable elements in 
expression vectors as they provide a means to select, for 
growth, only those cells - which contain a vector. Such 

15 markers are of two types: drug resistance and auxotrophic. 

A drug resistance marker enables cells to detoxify an 
exogenously added drug that would otherwise kill the cell. 
Auxotrophic markers allow cells to synthesize an essential 
component (usually an amino acid) while grown in media 

20 which lacks that essential component. 

Common selectable marker gene sequences include those for 
resistance to antibiotics such as ampicillin, tetracycline, 
kannamycin, bleomycin, streptomycin, hygromycin, neomycin, 
Zeocin™, and the like. Selectable auxotrophic gene 
25 sequences include, for example, hisD, which allows growth 

in histidine free media in the presence of histidinol. 
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A preferred selectable marker sequence for use in yeast 
expression systems is URA3 . Laboratory yeast strains 
carrying mutations in the gene which encodes orotidine-5' - 
phosphate decarboxylase, an enzyme essential for uracil 
5 biosynthesis, are unable to grow in the absence of 

exogenous uracil. A copy of the wild-type gene (ura4+ in 
S. pombe and URA3 in S. cerevisiae) will complement this 
defect in trans. 

A further element useful in an expression vector is an 
10 origin of replication sequence. Replication origins are 

unique DNA segments that contain multiple short repeated 
sequences that are recognized by multirneric origin-binding 
proteins and which play a key role in assembling DNA 
replication enzymes at- the origin site. Suitable origins 
15 of replication for use in expression vectors employed 

herein include E. coli oriC, 2Ai and ARS (both useful in 
yeast systems), sfl, SV40 (useful in mammalian systems), 
and the like. 

Additional elements that can be included in an expression 
20 vector employed in accordance with the present invention 

are sequences encoding affinity purification tags or 
epitope tags. Affinity purification tags are generally 
peptide sequences that can interact with a binding partner 
immobilized on a solid support. Synthetic DNA sequences 
25 encoding multiple consecutive single amino acids, such as 

histidine, when fused to the expressed protein, may be used 
for one-step purification of the recombinant protein by 
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nigh , fflnity binding to a resin coiumn _ such ^ ^ 

sepharose. *n endopeptidase recognition sequence can fce 
engineered between the pol K „ in0 acid tag and ^ 
■ of interest to all™ subsequent re. OT al o f the leader 
Peptide by -. digestion wUh Enterofcinase _ ^ othfir 
- Proteases. Sequences encoding peptides such as th.'chitin 
biding domain (which blnds to glutachlQne _ s _ 
transferase (which binds to glutathione,, biotin (which 
binds to av idln and strepavidin, , and the liKe can also be 
"-d for facilitate purification of the protein of 
interest. The affinity purification tag can be separated 
from the protein of interest by methods well known in th . 
art. including the use of inteins (protein self-splicing 
elements. Chong, et al, Gene 192:271-281, 1997,. 

Epitope tags are short peptide sequences that 
recognized by epitope specific antibodies. A fusion 
Protein comprising a recombinant protein and an epitope tag 
can be simply and easily purified using an antibody bound 
to a chromatography resrn. The presence of the epitope tag 
furthermore allows the recombinant protein to be detected 
in subsequent assays, such as Western blots, without having 
to produce an antibody specific for the recombinant protein 
itself. Examples ot commonly used ep . tope ^ 

glutathiones-transferase iczt\ k 

erase <GST), hemaglutinin (HA), the 

peptide Phe-His-His-Thr-Thr rhiHn k,- 

"i- -nr, chitm binding domain, and the 

like. 

A further useful elempnt- i n _ 

element in an expression vector i 



s a 
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multiple cloning site or polylinker. Synthetic DNA 
encoding a series of restriction endonuclease recognition 
sites is inserted into a plasmid vector downstream of the 
promoter element. These sites are engineered for 

convenient cloning of DNA into the vector at a specific 
position. 

The foregoing elements can be combined to produce 
expression vectors useful in creating the libraries of the 
invention. Suitable prokaryotic vectors include plasmids 
such as those capable of. replication in E. coil (fcr 
example, pBR322, ColEi, pSClOl, PACYC 134, itVX, pRSET , 
pBAD (Invitrogen, Carlsbad, CA) and the like). Such 
plasmids are disclosed by Sambrook (cf. "Molecular 
Cloning: A Laboratory Manual", second edition, edited by 
Sambrook, Fritsch, & Maniatis, Cold Spring Harbor 
Laboratory, (1989)). . Bacillus plasmids include pCl94, 
pC221, pT127, and the like, and are disclosed by Gryczan 
(In: The Molecular Biology of the Bacilli, Academic Press, 
NY (1982), pp. 307-329). Suitable Streptomyces plasmids 
include plJlOl (Kendall et al . , J. Bacteriol . 169:4177- 
4183, 1987) ,. and streptomyces bacteriophages such as c|>C31 
(Chater et al . , In: Sixth International Symposium on 
Actinomycetales Biology, Akademiai Kaido, Budapest, Hungary 
(1986), pp. 45-54). Pseudomonas plasmids are reviewed by 
John et al. (Rev. Infect. Dis. 8:693-704, 1986), and. 
Izaki (Jpn. J. Bacteriol. 33:729-742, 1978). 

Suitable eukaryotic plasmids include, for example, BPV, 
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vaccinia, SV40 o m< 

pY ES2/r . CirCl6 ' P C DNA3 . 1 , PCDNA3 . 1/GS 

their derivatives. Such plasmids are well 

knOWn ^ the art (Botstein et al m- 

al " Miami Wntr cu™~ 

Veast s h MOlSCUlar B1o1 °» - th. 

s SaCCh "°— C - le and Inheritance „, Cold 

Spnng Harbor Laboratory, cold 

470 i 981 . n . P Harb ° r ' Ny < P- 445- 

0. 1981, Broach. Cell 28;203 . 204 , 19M; 

"in. Hepatol. Oncol. i 0: 39- 48 1980 . M . 

B(„i ' Mani atis, In: Ce i 1 

B«lo B y: A Comprehensive Treati C ^ 

expression, Academic Press Ny 

l-ress, Nl, pp _ 563-608, 1980. 

Once plas.ids containing ^ ^ ^ / 

C ° rreCt Nation have been identified Srt . - thS 

Prepared for use in ,-h „ Pernio DNA i s 

6 transf -^tion of host cells f or 
expression. Methods ' ceils for 

tra , 1 ° f P^Paring plasmid DNA and 

transformation of cells »v , 

the art s „ ^ th ° SS in 

art. such methods are describe , 

Profcaryotic hosts are, generally, vefy f 
convenient for rh lcient and 

are, ther f """"^ " — and 

— o ,; oi fre r tly are ~ » — 

Recognized prokaryotic host-* • , 

con and those Jm * « 

from genera s uch as Baciu ^ 
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Pseudomonas, Salmonella, Serratia, and the like. However, 
under such conditions, the polypeptide, will not be 
glycosylated. The prokaryotic host selected for use herein 
must be compatible with the replicon and control sequences 
5 in the expression plasmid. 

Suitable hosts may often include eukaryotic cells. 
Preferred eukaryotic hosts include, for example, yeast, 
fungi, insect cells, and mammalian cells either in vivo, or 
in tissue culture. Mammalian cells which may be useful as 

10 hosts include HeLa cells, cells of fibroblast origin such 

as VERO, 3T3 or CHOK1, HEK 293 cells or cells of lymphoid 
origin (such as 32D cells) and their derivatives. 
Preferred mammalian host cells include nonadherent cells 
such as CHO, 32D, and the like. Preferred yeast host cells 

15 include S. pombe, Pichia pastoris, S. cerevisiae (such as 

INVScl), and the like. 

In addition, plant cells are also available as hosts, and 
control sequences compatible with plant cells are 
available, such as the cauliflower mosaic virus 35S and 

20 19S, nopaline synthase promoter and polyadenylat ion signal 

sequences, and the like. Another preferred host is an 
insect cell, for example the Drosophila larvae. Using 
insect cells as hosts, the Drosophila alcohol dehydrogenase 
promoter can be used. Rubin, Science 240:1453-1459, 1988). 

25 Alternatively, baculovirus vectors can be engineered to 

express large amounts of peptide encoded by a desire gene 
sequence in insects cells (Jasny, Science 238:1653, 1987); 
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Miller et al., i n: Gen etic Engineering (1986), setlow, 
J-K- ec al., eds./ Plenum, Vol. 8, pp. 277-297) The 
Present invention- also features the purified,- isolated or 
enrxched versions of the expressed gene products produced 
by the methods described above. 

This invention will be better understood from the 
Experimental Details which follow. However, one skilled n 
the art will readily appreciate that the S peciric methods 
and results discussed are merely illustrative of fn e 
invention as described more fully in the claims whlch 
follow thereafter. 

Experimental Details 

METHODS AND MATERIALS 

Preparation of 



15 Oligonucleotide 



Ibidem RNA^-DNA __ and DNA-p -Rj^ 



25 



CCCTT-containing 36-mer oligonucleotides containing a 
single internal » P - label at the scissile pnosphate ^ 
Prepared by ligating two 18-mer strands (synthetic RNA or 
DNA olxgonucleotides) that had been hybridized to a 
complementary 36-mer DNA strand. The sequence of the 
proximal CCCTT-containing 18-mer strand was 5 ,_ 
CATATCCGTGTCGCCCTT as DNA or 5< -CAUAUCCGUGUCCCUU as RNA 
The sequence of the dxstal 18-mer strand was 5< - 
ATTCCGATAGTGACTACA as DNA or 5' -AUUCCGAUAGUGA.CUACA as RNA 
The distal 18-mer strand was 5' -labeled in the presence of 
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[Y 32 P] ATP and T4 polynucleotide kinase., then gel-purified. 
The sequence of the 36-raer strand was 5' 
TGTAGTCACTATCGGAATAAGGGCGACACGGATATG. The strands were 
annealed in 0.2 M NaCl by heating at 65°C for 2 min, 
followed by slow-cooling to room temperature . The molar 
ratio of the 5' -labeled distal 18-mer to the proximal 18- 
mer and the 36-mer strand in the hybridization mixture was 
1:4:4. The singly nicked product of the annealing reaction 
was sealed in vitro with purified recombinant vaccinia 
virus DNA ligase (14, 15) . The ligation reaction mixtures 
(400 uD contained 50 mM Tris HC1 (pH 8.0),, 5 mM DTT 10 mM 
MnCl 2 , 1 mM ATP, 10 pmol of 5' 3: p-labeled nicked substrate, 
and 160 pmol of ligase. After incubation for 4 h at 22°C, 
the reactions were halted by the addition of EDTA to a 
final concentration of 25 mM. The samples were extracted 
with phenol-chloroform and the labeled nucleic acid was 
recovered from the aqueous phase by ethanol precipitation. 
The 36-mer duplex products were dissolved in TE buffer (1'J 
mM tris HC1, pH 8.0, 1 mM EDTA). Ligation of the labeled 
18-mer distal strand to the unlabeled CCCTT-containing 18- 
mer strand to form an internally labeled 36-mer product was 
confirmed by electrophoresis of the reaction products 
through a 17% denaturing polyacrylamide gel. The extents 
of ligation [ 36-mer/ ( 36-mer + 18-mer)] were as follows: 
DNA- p- DNA (88%) ; DNA-p-RNA (67%); RNA-p-DNA (66%). 

Covalent Binding of Topoisomerase to Internally Labeled 36- 
mer duolexes. 
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Recombinant vaccinia. topoisomerase was expressed xn 
bacteria and purified via phosphocellulose and SP5PW column 
chromatography as. described (16, 17). Reaction" mixtures 
for assay of covalent adduct formation contained ( per ,0 
PD 50 mM Tris-HCl (pH 8.0), 0.2 pmol of 36-mer duplex, and 
1 Pmol of topoisomerase. The reactions were initiated bv 
add lng topoxsomerase and halted by adding SDS to 1* f inal 
concentration. the samples were analyzed by SDS-PAGE. 
.Covalent complex formation was revealed by the transfer of 
radiolabeled polynucleotide to the topoisomerase 
Polypeptide (3). The extent Qf adduct formation ^ 
quantitated by scanning the gel using a FUJIX BAS1000 
Phosphorimager and was expressed as the percent of the 
input 5' -p-labeled 36-mer substrate that was covalently 
transferred to protein. 

DNA_Strand Transfer to an_RN A Acceptor . 

An 18-mer CCCTT-containing DNA oligonucleotide (5' - 
CGTGTCGCCCTTATTCCC ) was 5' end-labeled in the presence of 
fY -P] ATP and T4 polynucleotide kinase, then gel-purified 
and hybridized to a complementary 30-mer strand to form the 
18-mer/30-mer suicide cleavage substrate. Covalent 
toooisomerase-DNA complexes were formed in .. a reaction 
mixture containing (per 20 pi) 50 mM Tris-HCl ( P H 8.0), 0 5 
pmol of 18-mer/30-mer DNA, and 2.5 pmol of topoisomerase. 
The mixture was incubated for 5 nun at 37'c, The strand 
transfer reaction was initiated by adding an 18-mer 
acceptor strand 5' -ATTCCGATAGTGACTACA (either DNA or RNA) 
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to a concentration of 25 pmol/20 pi (i.e., a 50-fold molar 
excess over the input DNA substrate), while simultaneously 
adjusting the reaction mixtures to 0.3 M NaCl . The 
reactions were halted by addition of SDS and formamide to 
0.2% - and 50%, respectively. The samples were heat- 
denatured and then electrophoresed through a 17* 
polyacrylamide containing 7 M urea in TBE (90 mM Tris- 
borate, 2.5 mM EDTA) . The extent of strand transfer 
(expressed as the percent of input labeled DNA converted to 
a 30-mer strand transfer product) was quantitated by- 
scanning the wet gel with a phosphor imager . 

Preparation of 32 p-labeled 36-mer RNA . 

A 36-nucleotide run-off transcript was synthesized in vitro 
by T3 RNA polymerase from a pBluescript II-SK(-) plasmid 
template that had been linearized by digestion with 
endonuclease EagI . A transcription reaction mixture (100 
Ul) containing AO mM Tris HC1 (pH 8.0), 6 mM MgCl-, 2 mM 
spermidine, 10 mM NaCl, 10 mM DTT, 0.5 mM ATP, 0 . 5 mM CTP, 
0.5 mM UTP, 6.25 pM [a 3 ^Pj GTP, 5 yg of template DNA, and 
100 units of T3 RNA polymerase (Promega) was incubated for 
90 min at 37°C. The reaction was halted by adjusting the 
mixture to 0.1% SDS, 10 mM EDTA, and 0.5 M ammonium 
acetate. The samples were extracted with phenol-chloroform 
and ethanol-precipitated. The pellet was resuspended in 
formamide and electrophoresed. through a 12% polyacrylamide 
gel containing 7M urea in TBE. The radiolabeled 36-mer RNA 
was localized by autoradiography of the wet gel and eluted 
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from an excised gel slice by soaking for 16 h at 4°C in 0.4 
ml of buffer containing 1 M ammonium acetate, 0.2% SDS, and 
20 mM EDTA. The eluate was phenol-extracted and ethanol- 
precipitated. The RNA was resuspended in TE . 

Dephosphorylation of the RNA 5' terminus was carried out in 
a reaction mixture (30 pi) containing 10 mM Tris HC1 (pH 
7.9), 50 mM NaCl, 10 mM MgCl 2 , 1 mM DTT, 10 pmol of 36-mer 
RNA, and 30 units of calf intestine alkaline phosphatase 
(New England Biolabs) . After a 1 h incubation at 37°C, the 
mixture was phenol-extracted and ethanol-precipi tated . The 
phosphatase- treated 36-mer transcript was repurified 
electrophoret ically as described above. 

Affinity Tagging of RNA Using Vaccinia Topoisomerase 

The strand transfer reaction pathway is diagrammed in 
15 Figure 10a. The biotinyiated DNA Substrate which contains 

a single topoisomerase recognition site is immobilized on 
the Dynabeads (Dynal) streptavidin solid support. The 
biotin moiety (indicated by the black square) is introduced 
at the 5"' end of the CCCTT-containing strand via standard 
20 protocols for automated oligonucleotide synthesis. The 

purified vaccinia topoisomerase is reacted with the bead- 
bound DNA to form a covalent enzyme-DNA donor complex, as 
illustrated. Enzyme not bound to DNA is removed by washing 
the beads with buffer. The strand transfer reaction is 
25 initiated by addition of the [ 32 P]-CMP labeled T7 transcript 

which is dephosphorylated by prior treatment with alkaline 
phosphatase. The 5' single-strand tail of the donor 
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complex is complementary to the 12 nucleotides at the 5' 
end of the T7 transcript. Religation of the covalently 
held biotinylated DNA strand to the T7 transcript is 
observed as conversion of the 30-mer RNA to a product of .50 
nucleotides. 

Experimental Details : The DNA substrate was formed by 
annealing the biotinylated 25-mer strand containing the 
topoisomerase recognition site to a complementary 5' 
phosphorylated 24-mer strand (present at a 4-fold molar 
excess). The strands were annealed in the presence of 0.2 
M NaCl by heating at 65°C for 10 min, followed by slow 
cooling to room temperature. The biotinylated duplex was 
immobilized on streptavidin beads by incubating 10 pmol of 
the DNA with 10 ug of Dynabeads in 50 mM Tris-HCl (pH 8.0), 
1 M NaCl for 10 min at 22°C. The beads were recovered by 
centrifugation . The beads were rinsed twice with i ml of 
50 mM Tris-HCl (pH 8.0) . The washed beads were' resuspended 
in 20 ui of 50 mM Tris-HCl (pH 8.0). A 5-fold molar excess 
of topoisomerase (50 pmol) was added to the bead-linked DNA 
substrate. The mixture was incubated at 37°C for min. The 
beads were recovered by centrifugation, rinsed twice with 
1 ml of 50 mM Tris-HCl, then resuspended in 18 yl of 50 mM 
Tris-HCl, 0.3 M NaCl. Strand transfer was initiated by 
addition of 1 pmol of [ ?: P]-CMP labeled T7 transcript. The 
mixture was incubated at 37°C for 15 min. The beads were 
then recovered by centrifugation, washed, and resuspended 
in 20 pi of buffer containing 0.8% SDS and 80% formamide. 
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The samples were heated at 95°C for .5 min, centrifuged for 
5 min, then the supernatants were electrophoresed . through 
a 12% polyacrylamide gel containing 7M urea in TBE buffer. 
An autoradiograph of the gel is shown in Figure 10B. Lane 
5 B (Bound) --.product of the strand transfer reaction bound 

to the Dynabeads; lane F (Free) - supernatant from the 
strand transfer reaction. The positions of the input 30- 
mer T7 transcript and the 50-mer product are shown at the 
right. 

10 RNA substrate : The 30-nucleot ide runoff transcript was 

synthesized in vitro by T7 RNA polymerase from a 
pBluescript II-SK(-) plasmid template that had bee- 
linearized by digestion with endonuclease Xhol . The 
transcript was labeled with [a 32 P]-CTP under similar 

15 reaction conditions as described for preparation of the T3 

RNA transcript. The 30-mer RNA was gel-purified and 
subsequently dephosphorylated as described. 

RESULTS 

Covalent Binding of Topoisomerase to a Duplex Substrate . 
20 Containing RNA 3 ' of the Scissile Phosphate . 

Vaccinia topoisomerase does not bind covalently to CCCTT- 
contairxing RNA duplexes; nor does it form a covalent 
complex on RNA-DNA hybrid duplexes in which one of the two 
strands is RNA (9). Control experiments showed that the 
25 failure to form a covalent adduct on a CCCUU-containing RNA 

strand was not caused by uracil substitution for the 
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thymine bases in the CCCTT sequence (9) . To better 
understand why vaccinia topoisomerase does not form a 
covalent complex with all-RNA strands, we prepared 36-bp 
duplex substrates in which the scissile strand was a tandem 
RNA-DNA or DNA-RNA copolymer and the noncleaved strand was 
all-DNA (Fig. 1) . These duplexes were uniquely labeled 
with 32 P at the scissile phosphodiester . The substrate 
molecules were constructed by annealing two 18-mer 
oligonucleotides (one of which had been 5' ^'P-labeled) to 
a complementary 36-mer DNA strand to form a singly nicked 
duplex. The 5' -labeled 18-mer strand was then joined to 
the unlabeled CCCTT-strand (or CCCUU strand) in a reaction 
catalyzed by vaccinia virus DNA ligase. The 36-mer duplex 
products were isolated and then used as substrates for 
vaccinia DNA topoisomerase. We will refer to these 
substrates as DNA- p- DNA, DNA-p-RNA, and RNA-p-DNA, with the 
labeled phosphate being denoted by p. 

Transesterif ication by topoisomerase at the CCCTT site will 
result in covalent binding of a 3' 32 P-labeled 18-mer 
oligonucleotide to the enzyme. The extent of covalent 
complex formation on the DNA-p-RNA substrate in 10 min was 
proportional to input topoisomerase; 80-85% of the 36-mer 
strand was transferred to the topoisomerase at saturating 
enzyme (Fig. 1) . The same level of topoisomerase 

covalently bound less than X% of the RNA-p-DNA 3 6-mer 
strand. Hence, the topoisomerase tolerated RNA 

substitution downstream of the scissile phosphate, but was 
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impeded from forming the covalent adduct when the CCCTT 
sequence was in RNA form. 

The kinetics of the ' covalent binding reaction at a 
saturating level of topoisomerase were assessed (Fig. 2;. 
5 An all-DNA 36-mer (DNA-p-DNA) was bound to an endpoint cf 

21% in 2 min (Fig. 2A) . The apparent, cleavage-religat ion 
equilibrium constant (K c: - covalent complex/noncovalent 
complex) was 0.26, which agrees with values of 0.2 to 0.25 
reported previously for equilibrium cleavage of a 5' end- 

10 labeled CCCTT-containing DNA substrate (10, 11) . The DNA- 

p-RNA 36-mer was bound covalently to an endpoint of 80% in 
5 min (Fig. 2A, and other data not shown) . The apparent 
equilibrium constant for DNA- p- RNA (K C1 - 4) was 
significantly higher than that observed for the all-DNA 

15 ligand. 

The RNA- p- DNA 36-mer was transferred to the topoisomerase, 
albeit very slowly. After 4 h, 43 of the CCCUU-containing 
RNA strand was bound covalently to the enzyme (Fig. 2B) . 
An endpoint was not established in this experiment. 

20 However, by comparing the initial rate of covalent adduct 

formation on RNA- p- DNA (0.04% of input substrate cleaved 
per min) to the amount adduct formed on DNA-p-DNA at the 
earliest timepoint (12* in 10 sec), it is estimated that 
RNA substitution of the CCCTT-por tion of the substrate 

25 slowed the rate of covalent complex formation by about 

three orders of magnitude. 
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DNA Strand Transfer to an RNA Acceptor . 

Rejoining of the cleaved strand occurs by attack of a 5' 
hydroxyl terminated polynucleotide on the 3' phosphodiester 
bond between Tyr-274 and the CCCTT site. This 
5 transesterif ication step can be studied independent of 

strand cleavage by assaying the ability of a performed 
topoisomerase-DNA complex to religate the covalently held 
strand to a heterologous acceptor strand (5, 11). To form 
the covalent topoisomerase-DNA donor complex, the enzyme 
10 was initially incubated with a suicide substrate consisting 

of a 5' 3i P-labeled 18-mer scissile ' strand 
(CGTGTCGCCCTTATTCCC) hybridized to a 30-mer strand. 
Cleavage of this DNA by topoisomerase is accompanied by- 
dissociation of the 6-nucleotide leaving group, ATTCC . 
15 With no readily available acceptor for religation, the 

enzyme is essentially trapped on the DNA as a suicide 
intermediate (Fig. 3). In a 5 min reaction in enzyme 
excess, >90% of the 5' -P-labeled strand becomes covalently 
bound to protein. The strand transfer reaction was. 
20 initiated by adding a 50-fold molar excess of an 18-mer 

acceptor strand (either DNA or RNA) complementary to the 5' 
single-strand tail of the covalent donor complex (Fig. 3), 
while simultaneously increasing the ionic strength to 0.3 
M NaCl. Addition of NaCl during the religation phase 
25 promotes dissociation of the topoisomerase after strand 

closure and prevents recleavage of the strand transfer 
product. Ligation of the covalently held 12-mer 
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CGTGTCGCCCTT to the 18-rner yields a 32 P-labeled 30 mer (Fig. 
4, lane 1) . The suicide intermediate transferred 94s of 
the input CCCTT-containing strand to the 18-mer DNA strand 
(Fig. 3). The extent of religation at the earliest time 
5 point (5 sec) was 90% of the endpoint value. From this 

datum a religation rate constant (k roJ ) of >0.5 sec was 
calculated. A k rel value of ~1.3 sec*" ; had been determined 

.previously ( from experimental values for k c: and £ at 

;37°C) (18) . 

10 Topoisomerase readily ligated the covalently held 12-mer 

DNA to an 18-mer RNA acceptor to form a 30-mer produce 
(Fig. 4, lane 5) . 89% of the input CCCTT-strand was 
transferred to RNA, with 40% of the endpoint value attained 
in 5 sec. This datum was used to estimate a rate constant 

15 of 0.1 sec" 1 for single- turnover strand transfer to RNA. 

Thus, religation to DNA was about 10 Limes faster than 
religation to RNA. The slowed rate of RNA religation is 
likely to account for the observed increase in the 
cleavage-religation equilibrium constant = k ci /k cei ) on 

20 the DNA- p- RNA 3 6-mer. 

Analysis of the Strand Transfer Reaction Product 

The predicted product of strand transfer to RNA is a 30-mer 
tandem DNA- RNA strand (5' - CGTGTCGCCCTT^ UUCCGAUAGUGACUACA) 
uniquely 32 P-labeled at the 5' end. The structure of this 
25 molecule was confirmed by analysis of the susceptibility of 

this product to treatment with NaOH. The labeled 30-mer 
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RNA ligation product was converted nearly quantitatively 
into a discrete species that migrated more rapidly than the 
input 18-mer CCCTT-containing DNA strand (Fig. 4 lane 6) . 
The mobility of this product was consistent with a chain 
length of 13 nucleotides. The expected 3i P-labeled alkaline 
hydrolysis product of the RNA strand transfer product is a 
13-mer (5' -CGTGTCGCCCTTAp) . . Control reactions showed that 
neither the 3; P-labeled 18-mer scissile strand of the 
suicide substrate nor the 30-mer product of strand transfer 
to DNA was susceptible to alkali (Fig. 4, lanes 4 and 2) . 
It is concluded that topoisomerase can be used to iigate 
RNA to DNA in vitro. 

DNA ligand Tagging of an RNA Transcript Sy nthesized In 
Vitro by T3 RNA Polymerase . 

Practical applications of topoisomerase-mediated strand 
transfer to RNA include the V tagging of RNA transcripts. 
Bacteriophage RNA polymerases have been used widely to 
synthesize RNA polymerases have beer, used widely to 
synthesize RNA in vitro from plasmid DNA templates 
containing phase promoters. To test whether such 

transcripts were substrates for topoisomerase-catalyzed 
ligation, we constructed a CCCTT-containing suicide 
cleavage substrate that, when cleaved by topoisomerase, 
would contain a 5' single-strand tail complementary to the 
predicted 5' sequence of any RNA transcribed by T3 RNA 
polymerase from a pBluescript vector (Fig. 5). A 36- 
nucleotide T3 transcript was synthesized in a transcription 
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reaction containing ,^ P| GTP . The RNA was treated with 
.alkaline phosphatase to dephosphorylate the 5- termina- 
te topoisomerase-DNA covalent intermediate was forced on 
an unlabeled suicide substrate. Incubation of the 

radiolabeled T3 transcript with the suicide intermedia- 
resulted in the converse of the 36-mer RNA into a nove> 
specxes that migrated more slowly during polyacrylam.de oe^ 
-electrophoresis ,not shown,. The apparent size of th-s 
product U3 nucleotides, was indicative of ligation to t~ 
12-mer CCCTT DMA strand. The Nineties of DNA ligatio.-, to 
the T3 transcript are shown in Fig. 5. The reaction was 
virtually complete within 1 min; at its endpoint 29% of the 
input RNA had been Joined to DNA. No DN a-rna ligation 
product was formed in reaction containing a T3 transcript 

that had not been treated with 

cea Wlth alkaime phosphatase (noc 

shown) . 

The acceptor polynucleotides used in the experiment 
described above were capable of hybridizing perfectly with 
the 5- single-strand tali of the topoisomerase-DNA dono- 
=om P lex. It had been shown previously that the vaccinia 
virus topoisomerase is capable of joining the CCCTT-strard 
to an acceptor oligonucleotide that hybridizes so as to 
leave a singie nucleotide gap between the covalently boun , 
^nor 3- end and the 5 . terminus of the accepto- 
Religation across this gap generated a ! base deletion 
the product compared to the input scissile strand ,5, T. e 
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enzyme also catalyzes strand transfer to an acceptor 
oligonucleotide that, when hybridized, introduces an extra 
nucleotide between the donor 3' end and the penultimate 
base-paired nucleotide of the acceptor. Religation in this 
5 case will produce a 1 . base insertion (5). Deletion and 

insertion formation in vitro have also been documented for 
mammalian type I topoisomerase (19). However, there has 
been no report of the effects of acceptor strand gaps and 
insertions on the rate of strand joining by these enzymes. 

10 The kinetics of strand transfer by the vaccinia 

topoisomerase covalent intermediate to acceptor 
oligonucleotides that base-pair to the donor complex to 
form either a fully base-paired 3' duplex segment, or 3' 
duplexes with a 1-nucleotide gap, or a 2-nucleotide gap, 

15 were assessed. 84% of the input DNA substrate was ligated 

to the fully-paired acceptor in 10 sec, the earliest time 
analyzed (Fig. 6A) . The size of the strand transfer 
product was 30 nucleotides, as expected (Fig. 7, lane 3; . 
No 30-mer product was formed in the absence of the added 

20 acceptor strand (Fig. 7, lane 2). 

Religation across a 1-nucleotide gap was highly efficient, 
albeit slow. 85% of the input substrate was joined across 
a 1-nucleotide gap to yield the expected 29-nucleotide 
product (Fig. 6A and Fig. 7, lane 4). The kinetic data in 
25 Fig. 6 fit well to a single exponential with an apparent 

rate constant of 0.005 sec"'. Thus, single-turnover strand 
closure by topoisomerase across a 1-nucleotide gap was two 
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orders of mag nitude slower ^ ^ ^ ^ ^ 
a fully paired nick . yaccinia tQpoisomerase cataiv ^ rf 
strand transfer actoss a 2 . nucleotide ^ ^ 
antici pat ed 28 -nucleotide product (Fig . 7 , lane =) _ ^ 

thlS reaGti ° n fMble «• • linear accu-uUtion c- 

the 2-nucleotide gap product was observed over a , s 

been Joined. It was estimated ^ ^ ^ ^ 

-that relation across the 2-nucleotide gap was two orders 

of magnitude slower th*n -^i^ 

wer than joining across a 1-^nucleotide gao 
(and hence four ordure; ^-f ^ • ^ , 

orders or magnitude slower than the rate of 

joining across a nick) . 

Similar experiments were performed using DNA acceotors tha- 
contained either ! or 2 extra nucleotides at their 5' ends 
<Fx*. 6C). Rel igation to these acceptors yieided ^^^^ 

strand transfer products of 31 and 1? 

Ji and 3z .nucleotides, 
respectively (Fig . 7 , lanes 6 and 7) ^ o£ ^ ^ ^ 

"as relisted to for m tne l- nuc i eotlde lnsertion 

A rate constant of 0.0< sec- for relation 
«th 1-nucleotide insertion was c a lcul at ed. A ' simila*- 
endpornt was achieved in the for mat ion of a 2-nucleotide 

insertion product, but the srranH *- 

tne strand transfer rate w== 

considerably slower «r ig . 6C) . The observed rate con.fr" 
for 2-nucleotide xnsertion was 0.0001 sec-, i.e., thre . 
orders of magnitude !ower, than *.„ at a nlck . 
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Strand transfer by topoisomerase to a set of 18-mer 
acceptors that were capable of base-pairing with the 5' 
tail of the donor complex from positions -2 to -18 
(relative to the scissile +1 T : A base pair of the CCCTT 
element), but which have a base-mismatch at the -1 position 
immediately 3V of the scissile bond, was examined. The 
control acceptor, which has a normal -1 A:T base-pair, 
reacted to completion in 10 sec; 89% of the endpoint was 
achieved in 5 sec (Fig. 8). DNAs containing T:T, C:T, or 
G:T mispairs at the -1 position supported the same extent 
of strand transfer; 77% of the endpoint was attained in 5 
sec in each case (Fig. 8). Thus, within the limits of 
detection of this experiment, mismatch at the -1 position 
had little effect on the strand transfer reaction. There 
are clear and instructive differences between the effects 
of base mismatches versus a single nucleotide deletion on 
the rate of the strand joining step. 

Kinetics of Intramolecular Hairpin Formation . 

In the absence of an exogenous acceptor oligonucleotide, 
the 5' -OH terminus of the nonscissile strand of the 12- 
mer/30-mer covalent complex can flip back and act as the 
nucleophile in attacking the DNA- ( 3-phosphotyrosyl ) bond 
(5) . The reaction product is a hairpin molecule containing 
a 12-bp stem and an 18-nucleotide loop. The kinetics of 
this reaction were examined under single turnover 
conditions. In the experiment shown in Fig. 9A, 65% of the 
input CCCTT. strand was converted to hairpin product in 3 h. 
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12-bp stem and a 6-nucleotide loop. 69% of the input CCCTT 
strand was converted to hairpin product in 10 h. The 
observed rate constant was 8.2 x 10"-' sec" 1 . Thus, the 18- 
nucleotide 5' tail was -7 times more effective than the 5- 
mer 5' tail as the attacking nucleophile for strand 
transfer in cis. ■ Note that hairpin formation by these 
covalent complexes occurs without any 'potential for base- 
pairing by the single-strand tails. 

In order to examine the contribution of base-pairing to the 
rate of religation, the 5' terminal and penultimate bases 
of bottom strand of the 18-mer/30-mer substrate to 5' -AT 
(Fig. 9B) were altered. Now, the 5' -terminal three bases 
of the bottom strand (.5' -ATT) are identical to the 5' - 
terminal bases of the leaving strand (5' -ATTCCC) ; hence, 
the single-strand tail is self-complementary and capable of 
forming three base-pairs adjacent to the scissile 
phosphate. Intramolecular hairpin formation on this DNA 
was extremely fast; the reaction was complete in 10-20 sec 
(Fig. 9B) . The observed religation rate constant was 0.2 
sec" 1 . By comparing this value to the religation race 
constant on the non-complementary 18-mer/30-mer substrate 
(Fig. 9A) , it was surmised that 3 base-pairs accelerated 
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the reaction -350-fold. 

Kinetics of Single-Turnover Cleavage of a CCCTT-contain i n g 
Hairpin Molecule 

The 42-nucleotide 5' .^P-labeled hairpin product was gel- 
purified and tested as a substrate for covalent adduct 
formation by the vaccinia topoisomerase . 55%. of the input 
radioactivity was transferred to the topoisomerase 
polypeptide in 15 sec at 37°C; an endpoint of 90% transfer 
was attained in 60 sec (data not shown) . The apparent race 
constant for cleavage of the hairpin was 0.06 sec"'. Thus, 
the topoisomerase rapidly and efficiently cleaved a CCCTT- 
containing molecule in which there were no standard paired 
bases downstream of the scissile phosphate. The hairpin 
cleavage rate constant is about one- fifth of /c ci on the 18- 
mer/30-mer suicide substrate, which contains five paired 
bases of duplex DNA 3' of the CCCTT site. 

DISCUSSION 

Vaccinia topoisomerase catalyzes a diverse repertoire of 
strand transfer reactions. Religation of the covalently 
bound DNA to a perfectly base-paired acceptor DNA 
oligonucleotide provides a model for the strand closure 
step of the DNA relaxation reaction. Here, the kinetics of 
strand transfer to alternative nucleic acid acceptors are 
analyzed. The findings provide new insights into the 
parameters that affect transesterif ication rate, illuminate 
the potential for topoisomerase to generate mutations in 
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vivo, and suggest practical applications of vaccinia 
topoisomerase as an RMA modifying enzyme. 

Sugar Specificity for Covalent Ad duct Formation Resides 
within the CCCTT Element . 

Vaccinia topoisomerase is apparently incapable of binding 
covalently to CCCUU-containing RNA strands . This is the 
case whether the CCCUU strand is part of an RNA- RNA or an 
RNA-DNA duplex (9) . it has now been shown that the sugar 
specificity of the enzyme is attributable to a stringent 
requirement for DMA on the ~ S> side of the scissile 
phosphate, i.e., the CCCTT site must be DNA. Moreover, the 
CCCTT element must be a DNA- DNA duplex, because earlier 
experiments showed that a CCCTT strand is not cleaved when 
annealed to a complementary RNA strand (.9) . The RNA-DNA 
hybrid results are informative, because they suggest that 
the CCCTT site must adopt a B-form helical conformation m 
order to be cleaved. RNA and DNA polynucleotide chains 
adopt different conformations within an RNA-DNA hybrid, 
with the RNA strand retaining the A- form helical 
conformation (as found in dsRNA) while the DNA strand 
adopts a conformation that is neither strictly A nor B, but 
is instead intermediate in character between these two 
forms (20, 21). Vaccinia topoisomerase makes contacts with 
the nucleotide bases of the CCCTT site in the major groove 
(9, 22). It also makes contacts with specific phosphates 
of the CCCTT site (23). Adoption by the CCCTT site of a 
non-B conformation may weaken or preclude these contacts. 
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The finding that vaccinia topoisomerase is relatively 
insensitive to the nucleotide sugar composition downstream 
of the scissile phosphate implies that the conformation of 
the helix in this portion of the ligand is not important 
for site recognition or reaction chemistry. Topoisomerase 
cleaves DNA-p-RNA strands in which the leaving strand is 
RNA. Indeed, the extent of cleavage at equilibrium is 
significantly higher than that achieved on a DNA- p- DNA 
strand. 

Strand Transfer to RNA . 

The increase in the cleavage-religat ion equilibrium 
constant K eq (= Ki / K*. ) on the DNA-p-RNA substrate can be 
explained by the finding that the rate of single- turnover 
RNA religation k^ ytR::L is about one- tenth o& llD & 
Nonetheless, the extent of religation to RNA is quite high, 
i.e., -90% of the input CCCTT strand is religated to an 19- 
mer RNA acceptor strand in a 2 min reaction. It is shown 
that a CCCTT-containing DNA strand can be rapidly joined by 
topoisomerase to a transcript synthesized in vitro by 
bacteriophage RNA polymerase; -30% of the RNA is 
transferred to the DNA strand in a 2-5 min reaction. This 
property: can be exploited to 5' tag any RNA for which the 
5' terminal RNA sequence is known, i.e., by designing a 
suicide DNA cleavage substrate for vaccinia topoisomerase 
in which the nonscissile strand is complementary to the 5 f 
sequence of the intended RNA acceptor. Some practical 
applications include: (i) 32 P-labeling- of the 5' end of RNA 
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and (ii) affinity labeling the 5' end of RNA, e.g., by 
using a biotinylated topoisomerase cleavage substrate. A 
potential avantage of . topoisomerase-mediated RNA strand 
joining (compared with the standards RNA ligase reaction) 
is that ligation by topoisomerase can be targeted by the 
investigator to RNAs of interest within a complex mixture 
of RNA molecules. 

Frame-Shif t and Missense Mu t^n^ 

It was reported earlier that vaccinia topoisomerase can 
religate to complementary DNA acceptors containing recessed 
ends or. extra nucleotides, thereby generating the 
equivalent of frame-shift mutations (5). similar reactions 
have been described by Henningfeld and Hecht (19) for the 
cellular type I topoisomerase. A key question is whether 
these aberrant religation reactions are robust enough to 
implicate topoisomerase as a potential mutagen in vivo. The 
kinetic analysis suggests that they are and provides the 
first clue as zo what spectrum of frame-shift reactions are 
most likely to occur (taking into account only the 
intrinsic properties of the topoisomerase). For the 
vaccinia enzyme, the hierarchy of frame-shift generate 
religation reactions is as follows: + 1 insertion > -i 
deletion > + 2 insertion » - 2 deletion. 



The slowest of these topoisomerase catalyzed reacti 
strand closure across a 2-nucleotide gap (initial rat 
0.002% of input DNA religated/sec, . In this situation, the 
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attacking nucleophile is held in place at some distance 
from the DNA-protein phosphodiester by base-pairing to the 
nonscissile strand. Moving the 5' hydroxyl one base-pair 
closer to the phosphodiester enhances reaction rate by a 
factor of 100. Extra on-paired nucleotides appear to pose 
much less of an impediment to strand joining to form 1- or 
2 nucleotide insertions. The active site of the 

topoisomerase may be able to. accommodate extrahelical 
nucleotides; alternatively these nucleotides may 
intercalate into the DNA helix at the topoisomerase-induced 
nick. 



15 



There are two potential pathways for topoisomerase to form 
minus frame-shifts in vivo, which differ as to how the 
acceptor , strand is generated: (i) the 5' end of the leaving 
strand can be trimmed by a nuclease, after which ligation 
could occur across the resulting gap; or (ii) a homologous 
DNA single strand attacks the covaler.t intermediate. The 
second pathway presumably requires a helicase in order to 
form the invading strand (and perhaps also to displace the 
leaving strand) . In' the case of plus frame-shifts, only 
the latter pathway would be available to the topoisomerase, 
i.e., because no mechanism exists to add nucleotides to the 
5' terminus of the original leaving strand. No matter 
which pathway is taken, it is reasonable to assume that the 
25 most rapidly catalyzed mutagenic strand- j oining reactions 

are the ones most likely to make their mark in vivo. If 
the reiigation reaction is slow, as for -2 frame-shifting, 



20 
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topoisomerase . This could entail: (i) excision of a patch 
of the DNA strand to which the topoisomerase is bound; or 
(ii) hydrolysis of the topoisomerase-DNA adduct. An enzyme 
that catalyzes the latter reaction was discovered recently 
by Yang et al. (24) . 

Introducing a base mismatch at the -1 position immediately 
flanking the scissile phosphate has almost no effect on the 
rate of religation.. This result is in stark contrast to 
the 10" : rate effect of a 1-nucleotide gap. It is inferred 
that the -1 base mismatches do not significantly alter the 
proximity of the 5' -hydroxyl nucleophile of the terminal 
nucleotide to the scissile phosphate at enzyme's active 
site. The results indicate clearly that topoisomerase has 
the capacity to generate missense mutations in vitro. The 
single-strand invasion pathway involved above for frame- 
shift mutagenesis could, in principle, provide the 
opportunity for topoisomerase to create missense mutations 
in vivo. The kinetics of ligation in vitro suggest that 
topoisomerase-generated missense mutations would 
predominate over frame-shifts. 

The Kinetic Contribution of Base Complementarity 

Kinetic analysis of intramolecular hairpin formation by the 
vaccinia topoisomerase provides the first quantitative 
assessment of the role of base complementarity in strand 
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closure. The rate constant for attack on the DNA- (3' - 
phosphotyrosyl ) bond by a non-pairing 1 3-nucleot ide single 
strand linked in cis to the covalent complex was 5.7 x 10' ; 
sec 'K Altering only the terminal bases of the single- 

5 strand tail to allow base-pairing at the -1, -2, and -3 

positions increased the rate constant for hairpin formation 
by 350-fold. The rate of religation in cis with 3 
potential base-pairs was nearly the same as the rate of 
religation to a non-covalently linked acceptor strand that 

10 forms 18 base pairs 3' of the scissile bond. The ability 

of the covalently bound enzyme to take up and rapidly 
rejoin DNA strands with only three complementary 
nucleotides lends credence to the suggestion that vaccinia 
topoisomerase catalyzes the formation of recombination 

15 intermediates in vivo (25), either via strand invasion or 

by reciprocal strand transfer between two topoisomerase-DNA 
complexes . 

Generation of Gene Sequences 

The use of a DNA-tagged RNA to clone gene sequences was 
20 evaluated using 96 base test RNA fragment of known sequence. 

(GGG AGA CCC AAG CTC GCC CGG TTC TTT TTG TCA AGA CCG ACC 
TGT CCG GTG CCC TGA ATG AAC TGC AGG ACG AGG CAG CGC GGC TAT 
CGT GGC TGG) . This test RNA was synthesized using a T7 
Invitrotranscription kit from Ambion Co. using protocols 
25 supplied by the manufacturer. 

A topoisomerase-DNA intermediate was generated as follows: 
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T0P0P3) were added tc the beads and heated -to 70° C for 5 
minutes, then cooled on ice for 2 minutes. The beads were 
then washed twice with 25 pi each of NEB fll buffer (New 
England Biolabs - iOmM Bis Tris Propane-HCl, lOmM 
10 MgC12, lmMDTT pH7.0 @ 25°) to remove any unannealed 

oligonucleotides. The oligonucleotides were synthesized by 
Dalton Biochemicals (Canada) and had the following 
sequences: 

TOPOB1 - 5' B-GTTTTGGCTCCCATATACGACTCGCCCTTNTTCCGATAGTG 
15 TOPOP2 - 5' -NAAGGGCGAGTC 

TOPOP3 -. 5' -CACTATCGGAA. 

The 5' end of TOPOB1 was biotinylated by using a 
biotinylated guanine nucleotide during that round of 
automated synthesis. 

After the annealing step, the DNA substrate was modified 
using vaccinia topoisomerase basically as previously 
described. Approximately 2.5 g of vaccinia Topoisomerase 
1 was added to the beads in 25 pi of IX NEB #1 buffer. 
This mixture was placed on a rotating wheel for 5 minutes 
at room temperature then washed three times with 25 ul of 



20 



25 
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IX NEB #1 buffer. Approximately 100~200ng of the 96mer RNA 
was added to the washed topoisomerase-DNA intermediate 
bound beads in 10 pi, then 15 pi of 0.5 M NaCl (final cone. 
0.3 M) was added, and the tube was rotated for 5 minutes at 
5 room temperature. 

The DNA-tagged RNA bound beads were next washed twice with 
IX RT buffer (cDNA Cycle Kit, Invitrogen, Carlsbad, CA, 
cat. # L1310-01), primed with RT96 (synthesis of first 
strand) and PCR performed using the cDNA Cycle Kit 
10 according to the manufacturer's instructions and primers 

PCR96 and PCR53. 

RT96 - 5' -CCACGATAGCCGCGCT 

PCR96 - CGTCCTGCAGTTCATTCAG 

PCR53 - G G C T C C CAT AT AC G AC T C 

15 The reaction cycles were as follows: 2 minutes at 94 C C, 

then 25 - 35 cycles (10 sec/cycle) 94°C, 55°C and 72°C, 
followed by 5 minutes at 72°C. The resulting amplified cDNA 
was inserted into a plasmid vector using a TOPO™TA cloning 
Kit (Invitrogen, Carlsbad, CA, cat. KK4500-01) used 

20 according to the manufacturer's instructions. 

While the foregoing has been presented with reference to 
particular embodiments of the invention, it will be 
appreciated by those skilled in the art that changes in 
these embodiments may be made without departing from the 
25 principles and spirit of the invention, the scope of which 
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is defined by the appended claims 
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What is claimed is 

1. A method of covalently joining a DNA strand to an RNA 
strand comprising: 

(a) forming a topoisomerase-DNA intermediate by 
5 incubating a DNA cleavage substrate comprising a 

topoisomerase cleavage site with a topoisomerase 
specific for that site, wherein the 
topoisomerase-DNA intermediate has one or more 5' 
single-strand tails; and 

adding to the topoisomerase-DNA intermediate an 
acceptor RNA strand complementary to the 5' 
single-strand tail under conditions permitting a 
ligation of the covalently bound DNA strand of 
the topoisomerase-DNA intermediate to the RNA 
acceptor strand and dissociation of the 
topoisomerase, thereby covalently joining the DNA 
strand to the RNA strand. 



10 



15 



(b! 



2. A method of claim 1, wherein the DNA cleavage 
substrate is created by hybridising a DNA strand 
20 having a topoisomerase cleavage site to a 

complementary DNA strand, thereby forming a DNA 
cleavage substrate having a topoisomerase cleavage 
site and a oligonucleotide leaving group located 3' of 
a scissile bond. 



25 3. A method of claim 1, wherein the DNA cleavage 

substrate is a plasmid vector comprising a 
topoisomerase cleavage site. 

4. The method of claim 1, wherein the topoisomerase 
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cleavage site is a. sequence comprising CCCTT . 

5. The method of claim 1, wherein the topoisomerase is a 
vaccinia topoisomerase enzyme. 

6. The method of claim 1, wherein the DNA strand 
5 comprising a topoisomerase cleavage site is 

radiolabelled. 

7. The method of claim 6, wherein the radiolabel is V P or 
a radiohalogen . 

8. The method of claim 1, wherein the DNA strand having 
10 a topoisomerase clea ,ge site is labeled with a biotin 

moiety . 

9. The method of claim 1, wherein the topoisomerase-bound 
DNA intermediate and the acceptor RNA strand are 
ligated in vitro . 

15 10. A topoisomerase-DNA intermediate molecule comprising 

one or more 5' single-strand tails. 

11. The topoisomerase-DNA intermediate molecule of claim 
1C, wherein the 5' single-strand tail comprises a 
specific sequence. 

20 12. A topoisomerase-DNA intermediate molecule comprising 

a 5' single-strand tail generated by seep (a) of the 
method of claim 1 . 

13. A topoisomerase-DNA intermediate molecule comprising 
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a 5' single-strand tail generated by steps (a) of the 
method of claim 1, wherein the 5 f single-strand tail 
comprises a specific sequence. 

14. A topoisomerase-QNA intermediate molecule comprising 
a 5' single-strand tail generated by steps (a) of the 
method of claim 1, wherein the DNA strand is 
radio label led . 



15. The topoisomerase-DNA intermediate molecule of claim 
13, wherein the radiolabel is 3 *P or a radiohalogen . 

16. A topoisomerase-DNA intermediate molecule comprising 
a 5 f single-strand tail generated by steps (a) of the 
method of claim 1, wherein the DNA strand is affinity 
labeled. 



17. The topoisomerase-DNA intermediate molecule of claim 
16, wherein the affinity label is a biot in moiety , a 
chitin binding domain or a glutathione-S- t ransf erase 
moiety. 

18. A DNA-RNA molecule covalently joined by topoisomerase 
catalysis. 

19. A DNA-RNA molecule covalently joined by the method of 
claim 1 . 

20. The covalently joined DNA-RNA molecule of claim 19, 
having a 5' end label. 

21. The covalently joined DNA-RNA molecule of claim 20, 
wherein the 5' end label is 32 P or a radiohalogen. 
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22. The covalently joined DNA-RNA molecule of claim 20, 
wherein the 5' end label is a biotin moiety, a chitin 
binding domain, or a glutathione-S- trans f erase moiety. 

23. A covalently joined DNA-RNA molecule having a labeled 
5' end. 

24. The covalently joined DNA-RNA molecule of claim 23, 
wherein the 5' end label is 3:: P or a radiohalogen . 

25. The covalently joined DNA-RNA molecule of claim 23, 
wherein the 5' end label is a biotin moiety, a chitin 
binding domain, or a glutathione-S- trans f erase moiety. 

26. A method of tagging a 5' end of an RNA molecule 
comprising: 

(a) forming a topoisomerase-DNA intermediate by 
incubating a DNA cleavage substrate comprising a 
topoisomerase cleavage site with a topoisomerase 
specific for that site, wherein the 
topoisomerase-DNA intermediate has one or more 5' 
single-strand tails; and 

(b) adding to the topoisomerase-DNA intermediate a 
5'-hydroxyl terminated RNA molecule complementary 
to the 5' single-strand tail under conditions 
permitting a ligation of the covalently bound DNA 
strand of the topoisomerase -DNA intermediate to 
the RNA molecule and dissociation of the 
topoisomerase, thereby forming a 5' end tagged 
DNA-RNA ligation product. 

27. A method of claim 26, wherein the 5' -hydroxyl 
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terminated RNA molecule is the product of in vitro 
synthesis or isolation from cells or tissues. 

28. The method of claim 27, wherein the RNA molecule is 
dephosphorylated after synthesis or isolation. 

5 29. The method of claim 28, wherein the dephosphorylat ion 

is achieved by treatment of the RNA molecule with . 
alkaline phosphatase . 

30. A method of claim 26, wherein the DNA cleavage 
substrate is created by hybridizing a DNA strand 
10 having a topoisomerase cleavage site to a 

complementary DNA strand, thereby forming a DNA 
cleavage substrate having a topoisomerase cleavage 
site and a oligonucleotide leaving group located 3' of 
a scissile bond. 

15 31. The method of claim 26, wherein the topoisomerase is 

a vaccinia topoisomerase enzyme. 

32. The method of claim 26, wherein the cleavage sice 
comprises CCCTT . 

33. The method of claim 26, wherein the DNA comprises a 5' 
20 - end label . 

34. The method of claim 33, wherein the 5' end label is a 
biotin moiety, a chitin binding domain, or a 
glutathione-S-transf erase moiety. 

35. The method of claim 33, further comprising 
25 immobilizing the 5' end labeled DNA on a solid support 

prior to the addition of the 5' -hydroxyl terminated 
RNA molecule . 

36. The method of claim 35, wherein the solid support 
comprises streptavidin, avidin, chitin or glutathione. 



NSDOCID: <WO 9856943A1JB> 



10 



15 



20 



25 



PCT/US98/12372 

-72- 

37. The method of claim 35, further comprising, purifying 
a biotinylated 5' end tagged DNA-RNA ligation product 
by separating the solid support to which the 5' end 
labeled DNA-RNA ligation product is immobilized from 
a liquid phase comprising unmodified RNA. 

38. A 5' en"d tagged RNA molecule. 

39. The 5' end tagged RNA molecule of claim 38, wherein 
the tag is a DNA sequence. 

40. The 5' end tagged RNA molecule of claim 39, further 
comprising a 5' end label. 

41. The 5' end tagged RNA molecule of claim 41, wherein 
the label is J P or a radiohalogen . 



42 



The 5' end tagged RNA molecule of claim 43, wherein 
the label is a biotin moiety, a chitin. binding domain, 
or a glutathione-S-transferase moiety. 

43. A 5' end tagged RNA molecule generated by the method 
of claim 26. 

44. A DNA-RNA molecule which has been joined in vitro by 
the use of a topoisomerase . 

45. a method of obtaining full-length gene sequences 
comprising: 

(a) isolating full-length mRNA; 

(b) attaching a DNA tag sequence to the isolated 
mRNA; and 

(O synthesizing cDNA using the tagged mRNA as a 
template. 

46. A method of claim 45, wherein the mRNA is isolated by 
employing an affinity purification material. 

47. A method of claim 46, wherein the mRNA to be isolated 
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comprises an affinity purification tagged cap 
structure. 

48. A method of claim 46, wherein the affinity 
purification tag is a biotin moiety, a chitin binding 

5 domain or a glutathione-S- transferase moiety. 

49. A method of claim 46, wherein the affinity 
purification material comprises a solid support 
complexed with phenylboronic acid, st reptavidin , 
avidin, chitin or glutathione. 

10 50. A method of claim 49, wherein the solid support is 

magnetic beads cr sepharose. 

51. A method of claim 45 wherein the mRNA is isolated from 
plant cells or animal cells. 

52. A method of claim 51 wherein the animal cells are 
15 mammalian cells or insect cells. 

53. A method of claim 45, wherein the mRNA is decapped and 
dephosphorylated after isolation. 

54.. A method of claim 53 wherein the mRNA is decapped 
enzymatically or by .chemical treatment. 

20 55. A method of claim 54 wherein the enzyme is a 

pyrophosphatase . 

56. A method of claim 54 wherein the chemical treatment is 
periodate oxidation and beta elimination. 

57. A method of claim 53 wherein the mRNA is 
25 . dephosphorylated using alkaline phosphatase. 

58. A method of claim 45, wherein the DNA tag sequence 
comprises a recognition site for a type I 
topoisomerase . 
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A method of claim 58 wherein the DNA tag sequence 
further comprises a recognition site for a site- 
specific restriction endonuclease . 

A method of claim 58 wherein the type I topoisomerase 
is vaccinia DMA topoisomerase. 

A method of claim 58 wherein the DNA tag sequence 
comprises the double stranded sequence shown in Figure 
11 wherein N represents an adenosine moiety, a 
guanosine moiety, a cytosine moiety or a thymidine 
mciety. 

A method of claim 61 wherein N is 1 . to 4 nucleotide 
bases. 

A method of claim 61 wherein vaccinia DMA 
topoisomerase is covalently bound to the double 
s-randed tag sequence. 

A method of claim 45 further comprising amplifying the 
synthesized cDNA wherein the amplification primers 
comprise an anti-coding sequence of the tag sequence 
and a gene specific sequence (3' ) 



f Z. f \ 



A method of claim 64 further comprising inserting the 
amplified cDNA into an expression vector. 

A method of claim 45 wherein the DMA tag. sequence is 
a linearized expression vector. 

Ar. isolated full-length gene sequence prepared by the 
method of claim 45. 

A nucleic acid construct comprising an isolated full- 
length gene sequence prepared of the method of claim 
4 5 and an expression vector. 



\ nucleic acid construct of claim 68 wherein the 
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expression vector comprises one or more elements 
selected from: a promoter-enhancer sequence, a 
selection marker sequence, an origin of replication, 
an epitope-tag encoding sequence or an affinity 
5 purification-tag encoding sequence. 

70. A nucleic acid construct of claim 69 wherein the 
promoter-enhancer sequence is the T7 promoter, gall 
promoter, metallothionein promoter, AraC promoter, or 
CMV promoter-enhancer. 

10 .71. A nucleic acid construct of claim 69 wherein the 

selection marker sequence encodes an antibiotic 
resistance gene. 

72. A nucleic acid construct of claim 69 wherein the 
epitope-tag sequence encodes VS, the peptide Phe-His- 

15 His-Thr-Thr, hemaglut inin, or glutathione-S- 

transferase. 

73. A nucleic acid construct of claim 69 wherein the 
affinity purification-tag sequence encodes a polyamino 
acid sequence or a polypeptide. 

20 74. A nucleic acid construct of claim 73 wherein said 

polyamino acid sequence is polyhist idine . 

75. A nucleic acid construct of claim 73 wherein said 
polypeptide is chitin binding domain or glutathione-S- 
transferase. 

25 76. A nucleic acid construct of claim 73 wherein said 

polypeptide encoding sequence includes an intein 
encoding sequence . 

77. A nucleic acid construct of claim 68 wherein the 
expression vector is a eukaryotic expression vector or 
30 a prokaryotic expression vector. 



SOOCID: <WO 9856943A1 J8> 



WO 98/56943 

PCT/US98/12372 



10 



15 



20 



78 



-76- 



A nucleic acid construct of claim 77 wherein th- 
eukaryotic expression vector is pYES2 , pMT, plND, or 
PCDNA3.1. 

79. A method of obtaininc, full-l enqth gene sequences 

comprising: 

(a) isolating full-length mRNA by employing an 
affinity purification material;- 

(b) decapping and dephosphorylating the isolated 
mRNA; 

(O attaching a DMA tag sequence to the decapped, 
depnosphorylated mRNA, wherein the tag sequent 
comprises the sequence shown in Figure 11 and i s 
attached by vaccinia DNA topoisomerase; 



(dj 



synthesizing cDNA using the tagged mRNA as a 
template; 



<e> amplifying the synthesized cDNA, wherein the 
amplification primers comprise an anti-coding 
sequence of the tag sequence (5') and a gene 
specific sequence (3'); and 

(f» inserting the amplified cDNA into- an expression 
vector. 
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